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There is nothing more practical than a good 
theory. 


Foreword 


The field of circuit design for communications is a rapidly evolving area of research, 
with new technologies and applications emerging all the time. One of the challenges 
in this field is the need to accurately model and analyze the behavior of nonlinear 
systems. This book offers an approach to this problem, that is both more rigorous 
and intuitive than previously published. 

The book is organized into two main parts: signals and systems. The first part 
introduces the theory of distributions and covers topics such as basic properties of 
distributions, convolution, Fourier and Laplace transforms, and summable distribu- 
tions. The second part focuses on the application of distributions in the study of 
convolution equations and their solutions. 

In contrast to the few existing treatments, the approach taken highlights the alge- 
braic structure underlying weakly nonlinear systems and is based on distributions, 
rather than functions. The use of distributions leads naturally to the convolution 
algebras of Linear Time-Invariant (LTI) systems and the ones suitable for weakly 
nonlinear systems emerges as simple extensions to higher order distributions, without 
having to resort to ad hoc operators. The main advantages of the approach include 
a new justification for the validity of the Volterra series; with a much-simplified 
notation, free of multiple integrals. The net result being a conceptual simplification 
and the ability to solve the associated nonlinear differential equations in a purely 
algebraic way. 

Throughout the book, the author provides clear explanations of the key concepts 
and techniques, with numerous examples drawn from the area of circuits for wireless 
communications. As well as being of practical use to practitioners, these should help 
the reader to gain a deeper understanding of the material covered in the earlier part 
of the book. Of particular interest, to those interested in modern high-frequency 
circuit design, analysis of the classic amplifier cascode (common gate) shows the 
origin of nonlinear phenomenon, such as intermodulation, that can be accurately 
quantified from very simple models of the underlying transistors without having to 
resort to simulation. A similar analysis of local feedback in common source amplifiers 
(degeneration) shows similar rich behavior. As well as being theoretically interesting, 
this provides practical methods to the optimal design of such circuits. 


Vii 


viii Foreword 


This book is primarily intended for graduate students in engineering, who are 
interested in the theory of nonlinear systems and its application. However, it is likely 
that researchers in mathematics and physics will also find the material useful. 

I’m sure that this book will serve as a valuable resource for anyone working 
with nonlinear dynamical systems who wish to learn more about this interesting and 
relevant topic. 


Surrey, England Jon Strange 
April 2023 


Preface 


When I started working in industry I was immediately confronted with systems 
whose performance was invariably limited by noise and nonlinearities. For noise, 
there is a well-developed theory that can be used to guide the design and suggest 
ways to improve the system. For distortion, I was not aware of good theories and the 
investigations and developments were done entirely by numerical simulations. It’s 
not that I didn’t have interest in nonlinear systems. But rather than that, despite the 
fact that I graduated from a good university, the only courses offered on nonlinear 
systems were in the field of control theory and almost entirely devoted to stability 
questions. No course taught widely applicable methods to calculate the response of 
nonlinear systems. 

While working on trying to improve the linearity of a system, I read a paper 
making use of nonlinear transfer functions. Although I had heard the name “Volterra 
Series”, I did not know what it was, nor did anyone I knew. Stimulated by that paper I 
looked for a book, but didn’t find any in print. So, I looked in the second-hand market 
and found an out-of-print book mentioned in the paper. The book was very focused 
on practical applications, and didn’t say much about the broader theory. In any case, 
having learnt how to use nonlinear transfer functions, I started using them, and they 
immediately illuminated the reasons for some effects that I saw in simulations and 
that I didn’t fully grasp. Since then, I kept using that method very fruitfully. I also 
used nonlinear transfer functions in design reviews to try to pass on to colleagues 
the intuition that it gave me about nonlinear effects. 

After some time, some colleagues started to see the usefulness, and I was asked 
to prepare an internal tutorial on the subject. In preparing it, I realized how limited 
my theoretical understanding on the subject was and developed a desire for a much 
deeper understanding: I was hooked. I started studying it more by myself as well 
as through other (old) books, reports, and papers. While all studies that I saw did 
introduce some ad-hoc operators, I tried to develop a formulation using standard 
ones. When I realized that pairing convolution with tensor product would do, it 
became self-evident that the Volterra series could be seen as a generalization of the 
Taylor series. Convolution took front and center, but to make the mathematics solid, I 
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had to resort to Schwartz’s distributions. Differential equations became convolution 
equations and the Volterra series became a generalized formal power series. 

This book is a summary of my investigations in which I tried to develop the theory 
in a new and, I believe, simpler form. I included many examples. Some of them 
are short and serve to illustrate some points just discussed. Others are (condensed 
versions of) real-world applications where I try to illustrate the power of the theory. 

The book was written primarily for engineers. As is common in electrical engi- 
neering, I use the symbol j to denote the imaginary unit of complex numbers. This is 
to avoid confusing it with currents which are commonly denoted by i. All plots in the 
book were generated with the open-source CAS system maxima [1] which I also 
used to check many calculations and to compute all numerical solutions of differ- 
ential equations. All numerical simulations of nonlinear networks were performed 
with the open-source circuit simulator Xyce [2]. 

I would like to take the opportunity to thank some people that in a direct or 
indirect way have contributed to this book. First of all, I would like to thank my wife 
Alessandra for her constant encouragement and support during this project. I would 
also like to thank many colleagues at Analog Devices and Mediatek who shared their 
insights with me and stimulated me to go deeper. In particular, I would like to thank 
Jon Strange who, among other things, did put a lot of trust in me and involved me in 
many stimulating and future-looking projects. They were the stimulus that ultimately 
lead me to write this book. 


Alto Malcantone, Switzerland Federico Beffa 
February 2023 
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Chapter 1 A) 
Introduction E 


Nonlinear systems are everywhere, yet most engineering curricula devote very little 
time, if any, to them. The reason is twofold: first of all, there is no general theory 
describing arbitrary nonlinear systems. Second, the theory of linear systems is effec- 
tive in facilitating the design of many real-world systems. In fact, for sufficiently 
small input signals, the behaviour of most nonlinear systems can be approximated 
by a linear model. As a consequence the majority of engineered systems are designed 
based on linear system theory and their usability is limited in one way or another 
by the deviation of the real system from the assumed linear behaviour. This book 
is intended to give engineers a powerful tool to model, understand and reduce the 
impact of mild deviations from linear behaviour and thereby design better systems. 

This chapter tries to develop some intuition for what we call weakly-nonlinear 
systems. It also tries to give an idea of the theory to which this book is devoted and to 
its applicability. The chapter is not meant to introduce in a precise way any concept. 
In fact the exposition is rather informal. A proper systematic development of the 
theory will start with the next chapter. 


1.1 Nonlinear Phenomena 


The range of phenomena exhibited by nonlinear systems is much richer than the 
one of linear systems. To understand the applicability of the presented theory it’s 
useful to have an idea of the main ones that may appear. In the following we give a 
bird’s-eye view of them with qualitative descriptions. 


1.1.1 Multiple Equilibrium Points 


Most dynamical systems can be described by a system of differential equations that 
can be written in the form 


© The Author(s) 2024 1 
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Fig. 1.1 Pendumum in 
Earth’s gravitational field 


: f(u, x,t) 

Zu = uU, X, ’ 

dt i 

with u € R” the state of the system and x € R” the driving or input signal. For sim- 
plicity in this chapter we limit ourselves to autonomous systems. These are systems 
described by the simpler equation 


i = 1.1 
taf). (1.1) 


In the case of first and second order systems one can obtain a good qualitative 
understanding by examining the phase portrait of the system. This is a graphical 
representation of a family of state trajectories t +> u(t) for various initial conditions 
uo in the plane spanned by the components u; and wz of u that in this context is called 
the state or phase space. Note that the phase portrait can be sketched without having 
to solve the equation by considering the vector field defined by f in the state plane. 
With it, it’s easy to estimate the trajectory of the state u for every initial condition uo. 

Of special interest are the zeros of the vector function f. That’s because in those 
states the derivative with respect to time of the state vector u vanishes. In other words, 
the zeros of f are the equilibrium points of the system. A nonlinear function f in 
general has several equilibrium points and that’s a first fundamental difference from 
linear systems which always have only one equilibrium point. 

If f is well-behaved,! for every initial state uo, the system (1.1) has a unique solu- 
tion. This means that the trajectories in the phase plane do not intersect. Therefore, 
the trajectories can only begin or end at equilibrium points, at infinity or on limit 
cycles (see below). 

As an example consider the ideal friction-less pendulum shown in Fig. 1.1 and 
described by the differential equation 


q? p g 
ao nan, wo = 7 


' We will make this statement precise in a later chapter. 
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Fig. 1.2 Phase portrait of an 
ideal pendulum with wo = 1 


with @ the angle from the vertical, g the gravitational acceleration and / the length 
of the arm. The equation can be rewritten as 


d uy = U2 
dt \u2) — \—@p sin(u1) 


where we have set u; = ġ, u2 = dọ/dt. The phase portrait of this system is 
clearly periodic along the u axis. We can therefore limit the study to the range 
ui = [—x,2).? In this range the system has two equilibrium points: uoa = (0, 0) 
and uo, = (77, 0). 

The phase portrait of this system is depicted in Fig. 1.2 with the equilibrium 
points shown as black dots. The dashed lines connect the two equilibrium points and 
separates the phase plane in two distinct regions in which the system has different 
behaviour. The boundary between the two regions (the surface constituted by the 
dashed lines) is called the separatrix. Trajectories surrounding the equilibrium point 
uoa are closed curves that represent oscillations. The trajectories above and below 
the separatrix represent the pendulum perpetually rotating around the pivot. 

This shows a second fundamental difference from linear systems. Nonlinear sys- 
tems can exhibit different behaviour and characteristics in different regions of the 
phase space. 


2 Given that physically the angle @ and ø + 27 describe the same location, we should think of the 
phase space as a cylinder rather than a plane. 
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Fig. 1.3 a Electrical RLC oscillator with nonlinear feedback b Voltage-controlled current-source 
characteristic 


1.1.2 Limit Cycles 


A further phenomenon present in some nonlinear systems that doesn’t exist in linear 
ones is that of the limit cycles. These are periodic solutions of the equations at specific 
signal levels. As a simple example, consider the oscillator shown in Fig. 1.3a. It 
consists of a passive RLC resonator and a nonlinear saturating voltage-controlled 
current source (VCCS) with characteristic 


i(v) = Jp tanh (|) 


and plotted in Fig. 1.3b. The system is described by 


d? Wo d 2 
get Zh GmorR] ze + abv = 0. (1.2) 
with 
1 R 
Oy = -—: = —— 
g VLC 4 wL 


and the nonlinear transconductance 


d I 
Gm (v) = w” = 7 sech? (— ). 


If the maximum of |v(t)| over a full period 7 = 27 /a@p remains small compared to 
V, then the value of sech(v(t)/ Vç) remains very nearly 1 over a full cycle. Therefore, 
under this assumption, if G,,(0)R > 1 the coefficient of the first order derivative of 
v in (1.2) is negative and the (0, 0) equilibrium point of the equation is unstable. 
Differently from this, if the maximum of |v(t)| over a period is much larger than 
V, then the value of sech(u(t)/V;) approaches 0 for most part of a period. Hence, 
in this regime of operation the system is governed by an equation corresponding to 
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Fig. 1.4 Phase portrait of an 
electrical oscillator. 

wo = l rad/s, q = 4, R = 

1 Q, V; = 1 V, Io = 3V;/R 


the one of a damped oscillator. Between these two extreme cases there is a periodic 
trajectory, a limit cycle. On this trajectory the energy dissipated during one cycle by 
the resistor R is perfectly balanced by the energy injected in the resonator by the 
controlled source. This behaviour of the system is clearly discernible in the phase 
portrait shown in Fig. 1.4 in which we chose the current flowing through the inductor 
(downwards) iz and v as state variables. 

This example has a stable limit cycle, but there are systems with unstable limit 
cycles: any infinitesimally small deviation from the perfectly periodic trajectory leads 
to a trajectory diverging from the limit cycle. Limit cycles can also be stable on one 
side and unstable on the other one. 


1.1.3 Bifurcations 


All practical systems depend upon some parameters. For example the oscillator of the 
previous section depends on the value of the resistor R, and it is interesting to study 
how the value of that parameter affects the behaviour of the system. In particular 
the number and type of equilibrium points of a system may depend on the value 
of some parameter. This is in fact the case for our oscillator: For Gm(0)R < 1 the 
system has a single stable equilibrium point, while for Gm(0)R > 1 that equilibrium 
point becomes unstable and a limit cycle makes its appearance. Parameter values at 
which the character of the system behaviour changes are called critical or bifurcation 
points. 
As a second example, consider a system described by the differential equation 


d2 


JZ” =—hu+u?. 


Fig. 1.5 a Pitchfork 
bifurcation potential b 
Pitchfork bifurcation 
equilibrium points 
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The system can be interpreted as having a potential energy 


4 u? 


u 
Uxu) = 7AF 


2 


For à < 0 the potential energy has a single minimum at u = 0, while for A > 0 it has 


two minima as shown in Fig. 1.5a. In the latter case 
point and two new stable equilibrium points at 4 


u = Qis an unstable equilibrium 
EVA do appear. If we draw the 


equilibrium points of the system as a function of A one obtain the so-called pitchfork 


shown in Fig. 1.5b. 
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Fig. 1.6 Driven pendumum 
in Earth’s gravitational field 


yp(t) 


1.1.4 Chaos 


Consider the driven pendulum shown in Fig. 1.6. It is similar to the one of Sect. 1.1.1, 
with the difference that now the pivot moves in time in the vertical direction as 
described by the function yp. This movement introduces a driving term in the differ- 
ential equation that then becomes 


d? 2. sing d? 

qe” + œ sing = =r gave) 
Lets assume that the drive is periodic y,(t) = A cos(t). Figure 1.7 shows the time 
evolution of @ for two almost identical initial conditions. The upper curve was 
computed with the pendulum starting with (0) = Orad/s and at an angle of 
(0) = I rad. The lower curve was computed with almost identical initial conditions 
io (0) = Orad/s and ø (0) = 1 + 107!° /lrad. The lower curve was thus started with 
a displacement corresponding to approximately an atom diameter from the upper 
one. The evolution of the two is initially very similar. However, after some time 
they become completely different and uncorrelated. This extreme sensitivity to ini- 
tial conditions is the characteristic defining chaotic systems and makes long term 
predictions essentially impossible. In those systems the initial difference between 
adjacent trajectories grows on average exponentially [3]. 

In this simple case the phenomenon is intuitively understandable. When the pen- 
dulum reaches a position very close to the vertical, an infinitesimal difference in 
velocity can determine if it makes a full turn or if it goes back. 

Note that if the initial oscillation is sufficiently small, the force exercised by the 
vertical drive is almost orthogonal to the direction in which the mass is free to move. 
For this reason, small oscillations are not pushed to large swings and do not show 
chaotic behaviour. There are therefore regions of the phase space exhibiting chaotic 
behaviour and regions not exhibiting it. The areas of these regions depend of course 
on the amplitude A of the drive. Small values of A lead to large areas in which the 
system behaves predictably and only small areas displaying chaotic behaviour. 
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® (rad) 
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0 100 200 300 400 


Fig. 1.7 Time evolution of the driven pendulum with wọ = 1 rad/s, œ = 2wọ, g = 9, 8 m/s?, l = 


9.8m, A = 1/10. The upper curve was computed with initial conditions ¢ (0) = 1 rad, 40) = 


O rad/s, the lower one with initial conditions (0) = 1 + 10710 /lrad, (0) = Orad/s 


1.2 Weakly-Nonlinear Systems 


Chaos, bifurcations and other phenomena of nonlinear systems are fascinating, 
important and sometimes fundamental to the problem at hand. However, the vast 
majority of engineered system operate around stable equilibrium points by design. 
From an engineering point of view a quantitative theory to study the behaviour of 
nonlinear systems in the proximity of stable equilibrium points is therefore very 
important. 

Inspection of the presented phase portraits suggest that in the neighbourhood of 
equilibrium points the behaviour of nonlinear systems is not too different from the one 
of linear systems, the deviation increasing with increasing distance of the state from 
the equilibrium points. In fact this statement can be made more precise. Consider 
a time invariant, single-input single-output (SISO) system with input x whose state 
dynamics is governed by the system of first order differential equations 


“u(t =f(ult),x@), f:R"xROR 


and its output y by the algebraic equation 


y(t) = gult), x(t), 3g: R"xR-R. 
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If around the equilibrium point uọ = 0° and x = 0 the functions f and g are differ- 
entiable, then, using a Taylor expansion, the system behaviour can be approximated 
by the linear equations 


d 
ql x Ault) + Bs) AeR™, BeR™ 


and 
y(t) ~ Cult) + Dx), CeR'*", DeR. 


The response of the system to the input signal can then be expressed by a convolution 
integral between the impulse response h of the system and the input signal 


y(t) = h(t) * x(t). (1.3) 


Note that in this chapter by stable equilibrium point we mean one for which all 
eigenvalues of the linearized state equation are negative. 

The linear systems theory is very useful. However, for many practical applications 
this idealisation is too crude and doesn’t capture effects that limit the usability of 
a vast array of systems. The theory presented in this book enables one to solve the 
system equations when f and g are approximated by a higher order polynomial or 
even by power series. The theory therefore is able to give a more faithful description 
of the behaviour of many real systems. In particular, it allows probing into effects 
outside the reach of linear systems theory. 

Consider first a memory-less system, that is, a system whose output y(t) at time 
t depends only on the value of its input signal x(t) at time t and not on any of its 
past (or future) values. Such a system can be represented by a function g mapping 
for every value of t the value x(t) to y(t) 


y(t) = g(x(@)). 


Let’s assume that for a zero input signal the output is zero and that g can be expanded 
in a Taylor series. Then we can write 


[e0] 
yt) = Do aex*(). 
k=1 
Using the Dirac 6 distribution this expression can be written in the different form 


VA =J SCT o, T) ee Os, 


k=1 


3 By a change of variables it’s always possible to move the equilibrium point of interest to the origin. 
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That’s because, under assumptions to be made precise later, the 5 distribution is the 
unit of the convolution product 


b(t) * x(t) = x(t). 
The response of a linear memory-less system can therefore be written as 
y(t) = g15(T) * x(t) 


which shows a striking similarity with (1.3), the response of a linear dynamical system 
with impulse response A. In fact gô is the impulse response of a linear memory-less 
system, and it vanishes everywhere except at the origin as expected. 

From these considerations it’s natural to hypothesise that the response of a class of 
nonlinear systems, around a stable equilibrium point, can be represented by a series 
of the form 


OEDD CNE DEEM 


k=1 


This is in fact true, and it is the Volterra series representation of the system with Ax 
its kth order impulse response. This representation is valid only for sufficiently small 
input signals not pushing the state of the system beyond a separatrix. This limitation 
of the Volterra series should not surprise. In fact power series, which are a subset of 
the Volterra series, in general also have a finite convergence radius. 

The similarity between power series and the Volterra series doesn’t end here. By 
introducing suitable definitions, we can represent the cascade of nonlinear systems 
represented by their respective Volterra series in a similar way as the composition of 
power series. 

We call systems that can be represented by a Volterra series weakly-nonlinear 
systems. A feature of weakly-nonlinear systems shared with linear ones is the fact that 
the differential equations describing the system have to be solved only once to obtain 
the impulse responses. The response of the system to a large set of different input 
signals can then be computed directly from them. The impulse responses therefore 
completely characterise weakly-nonlinear systems. As with linear systems, weakly- 
nonlinear ones have a frequency domain representation in terms of nonlinear transfer 
functions. 

The theory can be extended to cover time-varying systems. In this case the impulse 
responses (or nonlinear transfer functions) have an explicit dependence on time 


Ak(t, Tlse.. Tk). 
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1.3 Distributions 


The Dirac ô distribution plays a key role in highlighting the relationship between 
power and Volterra series. An ad-hoc use of the ô distribution however easily leads 
to problems. 

Consider for example the Heaviside unit step function (or unit step function) 


0 t<0 
14) := eee (1.4) 


and the theorem stating that the Laplace transform of the derivative of a function f 
continuous for t > 0 is 
sF(s) — f(O+) 


with F the Laplace transform of f and f (0+) the right-hand side limit to 0 of the 
function. A careless application of this theorem to 1, gives 


d 1 
—1 =s--l= 
cg +} DF 0 


where we have used the fact that £ {14} = 1/s. However, we will show that the 
derivative of 1, is the ô impulse whose Laplace transform is 1. The error lies in the 
fact that 6 is not a function, but rather a Schwartz’s distribution, or distribution for 
short. The above theorem, in the stated form, is therefore not applicable. 

Distributions are the proper setting for studying linear and weakly-nonlinear sys- 
tems. In this setting the convolution product comes to play a central role. In fact, 
distributions allow defining convolution algebras with ô playing the role of the unit. 
The Laplace transform then not only maps convolution products into multiplications, 
but it also maps the unit of the convolution algebra into the unit of multiplication. In 
addition, in this setting, the derivative of a distribution f can be represented as the 
convolution of the distribution with the derivative of the unit 


of = “ef 

—f = —ôx* f. 

dt dt 
Differential equations can therefore be transformed into convolution equations to 
obtain a complete time-domain mirror image of the Laplace domain algebraic equa- 
tions. Distributions enable a uniform representation in terms of convolution products 


of ubiquitous and embarrassingly simple linear systems such as inductors, which a 
theory based on functions is unable to do 


LIS j 
= L—ô xi. 
v gon 


Here we see the current i as the input and the voltage v as the output of the system. 
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While we have been implicitly assuming causal systems and signals vanishing for 
t < 0, there are other convolution algebras. One of them is the convolution algebra of 
periodic distributions intimately related to the Fourier series, where the ô distribution 
plays a central role as well. 


1.4 Numerical Simulations 


Learning a theory requires some investment of time. The question is: is it worth it in 
a world full of computers and where numerical methods able to solve most nonlinear 
equations are readily available? In our view the answer is definitely a resounding yes. 
Numerical simulations and theory are not in competition, but rather they complement 
each other. 

The theory is able to reveal the origin of the various effects at play and to clarify 
how each parameter affects the performance of a system. However, to do so we must 
use relatively simple models and therefore, most of the time, obtain approximate 
results. 

Simulations on the other hand can be used to obtain accurate answers taking into 
account all details. However, the results are presented as tables of numbers (or curves) 
valid only for a specific set of values of the parameters. We can of course run many 
simulations and sweep parameters, but complex simulations are not fast and this 
poses practical limits. In addition, inferring the relationship between parameters and 
a specific effect from simulation results only is often challenging. We can say that 
a good theory based engineering model is like a (slightly distorted) picture, while 
numerical simulations are like dots of a halftone image. A good model is worth 
thousands of simulations. 

Most of the time the difficulty in engineering problems lies in finding the simplest 
model able to correctly characterise the effects of interest. During the phase of model 
development, numerical simulators can be extremely useful by using them as ideal 
laboratories in which to validate hypotheses. In these virtual laboratories it’s easy 
to change the laws of physics and suppress or decouple phenomena in a way that’s 
impossible in the real world. Experiments conducted in these virtual laboratories 
can therefore be an invaluable guide in the development of a model. Once a good 
model has been found, it will rapidly guide the development of the system. Numerical 
simulations will then serve further for final tuning and verification. 


1.5 Historical Notes 


Around 1887 Vito Volterra developed the concept of functionals as an extension of 
functions of multiple variables to ones with an uncountable infinite number [4, 5]. 
Let f(x1,...,X%) be areal valued function of the k real variables x; to xg. The latter 
can be interpreted as the values of a function x, evaluated at the discrete points 1 to 
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k. He therefore conceived a functional as a function of another function x defined 
over a continuous finite interval. He then proposed the series now bearing his name 
as an extension of the Taylor series from functions to functionals. 

In 1910 the mathematician M. Fréchet published a more detailed analysis of the 
conditions under which a functional can be expanded into a Volterra series [6]. This 
work is regarded by most as the foundation demonstrating the validity of Volterra’s 
series expansion. 

Volterra was well known and was invited to present his works in several countries, 
including the United States. During the second World War there was great pressure 
to develop anti-aircraft systems and N. Wiener of the Massachusetts Institute of 
Technology (MIT) found that by using Volterra’s series he could analyse the response 
of a nonlinear device to noise [7]. His report was initially restricted. Its release 
after the war sparked interest in the engineering community at MIT and elsewhere. 
Several studies followed in the 50s to the 70s applying Volterra’s series to nonlinear 
engineering problems, with [8—10] among the most significant ones. Wiener himself 
remained interested in the subject and developed his own variant of the theory based 
on Browning motion and leading to what’s now called the Wiener series [11]. At the 
beginning of the 80s some books summarised the Volterra and Wiener theories [12, 
13]. At around the same time, with the raise of desktop computers, engineering efforts 
started more and more to embrace numerical methods. 

During the first decades of the 20th century there were two mathematical methods 
used by engineers and physicists that kept mathematicians occupied. The first was to 
find a solid mathematical justification for the operational calculus popularised by O. 
Heaviside. The second was the search for a solid mathematical interpretation for the 
6 distribution and its derivatives extensively used by P. A. M. Dirac in his landmark 
treatise on quantum mechanics which first appeared in 1930 [14]. 

The former was solved in two ways: 


(i) With the help of the Laplace- and Fourier-transforms, highlighting the frequency 
domain aspects, and 
(ii) by Mikusinski [15] using purely algebraic methods. 


The second was solved by L. Schwartz by introducing new mathematical objects 
called distributions [16]. These are a special class of functionals with particularly 
attractive properties such at the fact that they are indefinitely differentiable. In addi- 
tion, differentiation of distributions is a linear operation making series always differ- 
entiable term by term. With distributions Schwartz not only did put the ô “function” 
and its derivatives on a solid ground, but he also introduced convolution algebras and 
unified the two justifications for the operational calculus. 

A deep understanding of distributions requires familiarity with advanced con- 
cepts of topological vector spaces [16, 17], which is probably why they are rarely 
introduced to engineers. However, the elementary part of the theory can be devel- 
oped without recurring to particularly deep mathematical concepts and is of great 
practical value in physics and engineering problems. The aim of this book is to intro- 
duce distributions to engineers and use them to view the Volterra series and, more 
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generally, weakly-nonlinear systems from a point of view different from the tradi- 
tional one. The advantages are, among others, a conceptual simplification, a simpler 
notation freeing expressions from multiple integrals and an exposition of the theory 
of weakly-nonlinear systems as a natural extension of the linear one. 
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Part I 
Signals 


Chapter 2 A) 
Distributions E 


We investigate the mathematical description of signals that are commonly used in 
the analysis of technical problems. From a mathematical point of view it would be 
useful to limit the signals of interest to the set of continuous functions. However, 
this has several disadvantages. For example, suppose we are interested in the tran- 
sient response of, say, a series RC low-pass-filter (LPF) to an input step voltage. 
If we describe the input signal with a continuous function, then the details of the 
calculations depend on the chosen description of the input signal rise transient. This 
however tends to mask the fact that, if the LPF time constant is much larger than 
the input signal rise-time, the output response is essentially independent of the input 
signal transient shape. For this reason, in these situations, it is much more convenient 
to use an idealized input unit step function such as the Heaviside unit step function 


0t<0 
CE 


which is not continuous at t = 0. 

Consider further the LPF example. If we write the differential equation using the 
current as unknown, then we need the derivative of the driving signal. However, the 
derivative of the function 1, does not exist at t = 0 and is zero at every other point. 
We are therefore led to introduce the so-called Dirac impulse ô which however is not 
even a function, but rather a generalized function or distribution. 

It follows from these considerations that a correct description of commonly used 
signals belongs to the theory of distributions. Distributions have many useful prop- 
erties. The key one being that they can be differentiated any number of times. The 
main contributor to the development of this theory was Schwartz [16]. 
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2.1 Test Functions 


The key idea in the theory of distributions, is not to direct attention to the value of 
a function at every point of its domain, but instead to “measure” the behavior of a 
function when acting on a class of particularly well-behaved functions. In this section 
we introduce one such class of functions, the class of test functions. 

Let k = (ki, .. . , kn) be an n-tuple of non-negative integers called a multi-index. 
The differential operator of order |k| is defined by 


Dies (2.1) 


with 
|k| := ky ++ +k, (2.2) 


the length of the multi-index k and t € R”. For functions of a single variable we also 
use the following shorter notation for the kth order derivative 


(2.3) 


where in this case k is of course a single non-negative integer. 
Given an open set U C R”, the set of all k-times continuously differentiable 
functions f : U —> C is denoted by C*(U) or simply C*. 


Definition 2.1 (Test function) A function ¢ : R” — C is called a test function if 
it is indefinitely differentiable and has compact support, that is, if ø € C% and 
(T1, .--, Tn) = 0 outside a compact set K. The vector space of all such functions 
is denoted by D. 


To define a continuity criterion for distributions we need to define a topology that 
can be encoded in the form of a convergence principle in D. 


Definition 2.2 (Convergence of test functions) A sequence of functions m € 
D, m €E Nis said to converge to @ € D, in symbols 


m—> oo 


Pin ae (o) or lim bn =O, 


if the following two conditions are met: 


1. There exist a compact set K such that it includes the support of all ¢,, and of @. 
2. For every n-tuple k the sequence of functions D‘¢,, converges uniformly toward 
D*¢. 


These conditions ensure that the limiting function (i) has compact support and (ii) 
that it is indefinitely differentiable, in other words, that the limiting function is also 
a test function. 
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Fig. 2.1 Example test 2 
function 
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1 L 
0.5 F 
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Example 2.1: Test function 


Consider the following functions (see Fig. 2.1) 
B,(t) := U oo) for |vt| < 1 (2.4) 


for |vt| > 1 
B =f ert d 


For each value of v > 0 and for |vt| < 1 the function £, is the composition of 
a rational function with no singularities and the exponential function. Since the 
latter two functions are indefinitely differentiable and the composition of indefinitely 
differentiable functions is also indefinitely differentiable, it follows that, in this range, 
B, is indefinitely differentiable. 

To establish that £, is a test function we have further to show that 


lim D*B,(t) =0 (2.5) 


|vt|t 


for all values of k. This can be done by induction: assume that the kth order derivative 
is the product of 6, and a polynomial in the two variables tı = 1/(1 — vt) and 
tm = 1/(1 + vt) 


DŽ B, (t) = pe ( ) B®). (2.6) 


l1— vt 1+vt 


This is clearly the case for k = 0. We show that this is then true for k + 1: 


20 2 Distributions 


D! B, (t) 
_ [Ome a 
Gey" 2 wt+1? 


1 1 1 1 
v (DIPA (T T) UPR ae) P 
(vt — 1)" 2(vt—1)? 


1 1 
= Pk+1 (—. <=) b(t). (2.7) 


If we express the limit as vt tends to 1 in terms of t1, we see that it is the limit of the 
product of a polynomial and a decreasing exponential which converges to 0 


: ; Ti voo i 
lim D* y(t) = lim q, ——)—e ™I=0. 2.8 
lim D*O = lim p zp ge (2.8) 


Similarly, the limit towards —1 can be expressed in terms of t2 with the same result. 
Hence f, € C”. 

While £, is a test function for each value of v, the sequence (Bm), m € N doesn’t 
converge in D. For t 4 0, m — œ the value of m(t) converges toward zero, while 
the value of the functions at t = 0 grows without bounds. The limiting function is 
therefore not continuous. 

The sequence (1/m) also doesn’t converge in D. As m — oo the support of the 
functions grows without bounds. It is therefore not possible to find a compact set K 
containing the support of all members of the sequence as well as that of the limiting 
function. 

An example of a converging sequence is Bm /m” which converges toward the zero 
function. 


Example 2.2: Regularisation 


Consider an impulse of finite duration (see Fig. 2.2a) 


10<t<k 
0 otherwise 


10) = 140) — 14- k) = | 


This function is clearly not continuous att = 0 and t = k. These jump discontinuities 
can be removed by convolving 1, with the function £, of the previous example 


oo k 
tex pin = f te) B= nae = | By(t—t)dt. (2.9) 
oo 0 


We say that the so obtained function is the regularised of 1; by p, (see Fig. 2.2a). 
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Fig. 2.2 a Regularized of the discontinuous function 12(.) b Construction of the regularized of 


12(.) 


Observe that 1, * 6, is just a definite integral of £, and is therefore indefinitely 
differentiable. If the support of 6, lies completely within the integration range, then, 
given the chosen normalization constant for 6,,, the value of 1, * 8, is 1. If the support 
of f, doesn’t intersect the integration range, then the value of 1, * 6, is 0. For the 
remaining values of the independent variable t, 0 < 1, * B,(t) < 1 (see Fig. 2.2b) 


1 l/v<t<k-I1/v 
1, * By(t) = 4 0 t<—l/vort>k+l1/v (2.10) 
> O and < 1 otherwise. 


We have thus established that 1; x 6, € D. 
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From this example we see that for any open interval U and any closed interval 
K C U we can construct a real valued test function @ with 0 < (t) < 1, a value 
of 1 within K and a value of 0 outside of U. This is a useful property that we will 
exploit later. 

A similar construction can be made for test functions of more than one variable. 
For later reference we define a real valued test function with values between 0 and 1 
that we call œ such that 


L |r| <1 


Olio. (2.11) 


a: R” = [0, 1], r| 


2.2 Distributions 


A key aspect of the theory of distributions is the fact that it makes continuous functions 
differentiable any number of times. To see how this goes, remember from calculus 
that by partial integration we can transfer the operation of differentiation from one 
function to another one. Thus, if we pair the function of interest f with a function @ 
differentiable everywhere, then we can relate the derivative of f with a well-defined 
expression 


/ Df (t) b(t) dt = f(t) OI -f f(t) De(t) dt. (2.12) 


To make the expression independent of the limits of integration, the first term on the 
right-hand side should disappear. This can be achieved, for example, by choosing a 
function @ with compact support. In addition, to be able to assign a meaning to the 
derivative of any order, the function @ should be indefinitely differentiable. Note that 
these are precisely the properties of test functions. 

An additional requirement is that of the assignment being unique. For example, 
the right-hand side expression should be identically zero only if Df = 0 (almost 
everywhere). Suppose that f has compact support. If the support of Do doesn’t 
overlap with the one of f then the right-hand expression is also zero and the assign- 
ment is not unique. To avoid this situation we are forced to pair the function f with 
every test function @ € D. 

A distribution is a generalization of these ideas and is defined as follows. 


Definition 2.3 (Distribution) A distribution is defined as a linear, continuous func- 
tion on the set of test functions 


T: DR") >C, oh (T,¢) (2.13) 
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This means that a distribution T has the following properties: 


1. (T, ġ1 + 2) = (T, 1) + (T, 2) for all ġ1, @2 € D. 
2. (T,c¢) =c(T, >) forall € Dandc €C. 
3. From ġk ar ¢ it follows that (T, z) —> (T, ¢), where the latter is the normal 


convergence of complex numbers. 


Since distributions are linear by definition, the condition of continuity can be 
expressed in a slightly different, but equivalent way: 


3’. From $y a 0 it follows that (T, ø} —> 0. 


Two distributions T; and T, are equal if (Ti, 6) = (T, $) for every test function 
$ € D. A distribution is called real if it evaluates to a real number when applied to 
any real valued test function. 

The set of all distributions forms a vector space denoted by D’, where addition of 
two distributions Tı and T) and multiplication with a complex constant c are defined 
by 


(Ti + Th, @) := (Ti, ) + (Ta, $) 
(cT, b) := c(T, $) = (T, c ġ). 
A mapping assigning a number to every element of a vector space is called a func- 
tional. Distributions are therefore functionals on test functions. 
Example 2.3: Functions as distributions 


Consider a continuous function f € C(R"). We can associate with it a distribution 
Tp by the procedure outlined at the beginning of the section 


(Ty, 6) = Í. f(t) (rt) d"r. (2.14) 


Linearity is clear from the properties of integrals. To see that it is continuous, consider 
a sequence of test functions converging to zero @m D 0. Then 


| f (0) @m(t) d"t < sup nO f |f(t)|d"t — 0 
R” teK K 


with K a compact set including the support of all ¢,,. 

Consider now two continuous functions fı and f2. If (Th, 6) = (Th, $) for every 
$ € D, then, by the properties of integrals of continuous functions, it follows that 
fı = fo. We thus have an injective mapping from continuous functions to distri- 
butions. We can therefore identify continuous functions with their corresponding 
distributions and write (f, ø) instead of (Ty, $). 
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The theory of distributions requires the use of Lebesgue integrals as opposed 
to Riemann ones, as Lebesgue’s integration theory is more powerful and allows 
integrating a broader set of functions. Of course, when both integrals do exist, they 
coincide. A key concept in the Lebesgue theory of integration is that of the measure. 
For our purposes we can think of the Lebesgue measure as a volume and a set of zero 
measure in R” as a (sufficiently regular) subspace of dimension k < n. A point on 
the real line R, a line on a plane and a surface in R? are all examples of sets of zero 
measure. The union of a denumerable family of sets of zero measure is itself a set 
of zero measure. Therefore, the set of rational numbers on the real line R has zero 
measure. Two locally integrable functions differing only on a set of zero measure are 
said to be equal almost everywhere. 


Example 2.4: Locally integrable functions 


Consider a locally integrable function f € £}, (R"), a function that is Lebesgue 
integrable over every compact set K C R”. As in the previous example we can 
associate it with a distribution through the integral (2.14). In this case however the 
mapping is not injective. Any two locally integrable functions fı and fz differing 
only in a set of measure zero produce the same value (f1, 6) = (f2, 6) for every 
ġ € D. That means that they map to the same distribution. 

In physical and engineering applications the values of a function in a set of zero 
measure is often unimportant. It is therefore natural to consider the equivalence class 
of all functions differing at most on a set of zero measure. In this way we obtain 
again an injective mapping, but now from the equivalence class of locally integrable 
functions differing at most on a set of zero measure (equal almost everywhere) to 
distributions, and we can again identify without ambiguity the former with the latter. 
To avoid overloading the notation we write a representative for the equivalence class, 
that is, we write (f, @) where f is a representative. 


All distributions that can be represented by locally integrable functions through 
(2.14) are called regular distributions. However, not all distributions are regular 
and distributions that aren’t regular are called singular distributions. Nonetheless, 
regular distributions are dense in D’. That is, in a similar way as real numbers arise 
as a limiting process from rational ones, any distribution can be represented as a limit 
of regular distributions, where the convergence of distributions is defined as follows. 


Definition 2.4 (Convergence of distributions) A sequence of distributions (Tin) men 
is said to converge to the distribution T , if the sequence of numbers (Tm, @) converges 
to the number (T, ¢) for every ¢ € D. In symbols 


Tn — T or lim 7, =T 
D m—>oo 
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if 
(Im, @) — (T, ġ) foreveryọo Ee D. 


Itis not obvious that the limit T is in fact a distribution, that is, linear and continuous. 
However, this is indeed the case. The space D’ is thus closed under convergence. A 
proof can be found in [18]. 

This definition is based on a discrete parameter m traversing the natural numbers. 
If the parameter traverses a continuous set of values the situation is similar and can 
be reduced to the discrete case. Consider the sequence of distributions T, depending 
on the continuous parameter v € R. For each value of v and each test function ¢, 
the functional (T,, ¢@) evaluates to a number. For each test function ¢ the set of 
distributions T, therefore defines a function of v 


tw) = (T,, Q). 


Lets define a sequence (vm)men Of values converging toward infinity. If for every 
such sequence and every test function 


jim ¢ (vg) = lim (7,6) = (T, $) 


then 
lim (T,, $) = (T, $). 


Similarly for a continuous parameter converging toward a finite limit n. 


Example 2.5: Dirac delta distribution 


Consider the functions m of Example 2.1. The regular distributions associated with 
these functions form a sequence converging to a singular distribution. We have 


Limyn—00(Bm» $) 
= limaran Bn(t) b(t) dt 


1/m 1/m 


= limmo [Sn Bn (2) (0) de + Sm Pa b(t) — HOla} 


Since test functions are continuous and differentiable we can use the mean value 
theorem to express ¢ as 


(t) = (0) + DoA)t 


for some A € (0, t). With this we can see that the second term converges to zero 
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1/m 1/m 
Í, Pm (T) lØ) — (0)] dt <f lm) 16(t) — 6(O)| dt 


/m 
IDA i” 
= sup > aS |Bm(T)| dt 
re(—4,4) m —1/m 
— 0 


We therefore obtain 
1/m 
Lim (Ans) = (0) lim f Badr 
m—>0o mo J ijm 
= ¢(0) lim 1 
m—>œ 
= ġ (0). 


The sequence 8,,, thus converge to the Dirac delta distribution 6 which is defined by 
(8, d) := o (0). (2.15) 


Besides the sequence m there are many other regular distribution sequences 
converging to 6. For example, with the same procedure used above, it is simple to 
show that the sequence defined by the following functions does also converge to ô 


_ f m/2 || <1/m 
masi, t| > 1/m 


Note that the notation used in many technical texts to define the Dirac delta 
distribution is not mathematically correct and only has a symbolic value 


f d(t) p(T)dt = (0). 


This notation imply the existence of a function with a value of zero everywhere but 
at t = 0 where its value is infinite. However, the value of the Lebesgue integral of 
such a function is zero since a single point of the real line has zero measure. This 
notation is however useful as it helps to remember several properties that we will see 
shortly. 


Example 2.6: Cauchy principal value 


The function f(t) = 1/t is not locally integrable. For this reason we can’t associate 
with it a regular distribution through (2.14). A way around this is to use the Cauchy 
principal value of the integral to define the following singular distribution 
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© oO G 


nial 8 “$0, ref % OD) ar (2.16) 
«10 


Integrating by parts the first integral we obtain 


rts} 


PO ar = = ((t) In|t])|_ s-f In|t| Dé(t) dt 


—=00 


= ġ(—e€) ln |e| — T In|t| Dé(t) dt 


and similarly for the second integral 


i ee) dt = —$(©) In(e) — [mo Do(t)dt. 


€ 


We see that the first term of both integrals do diverge as € goes to 0. However, using 
the mean value theorem, we note that there are values à; € (0, €) and Az € (—e, 0) 
such that 


P(e) = $0) +D) 
b(—€) = 6(0) — e Do A2) 


With M = — (D¢(A;) + Do (à2)), the limit of the sum of the diverging parts there- 
fore do cancel 


lim Me Inle| = 0 
«10 


and we finally obtain 


1 CO 
(pv a?) =- f In|t| Do(t) dt. 


This last integral is well defined as In |t| is locally integrable and therefore defines a 
well defined regular distribution. We will meet this distribution again in the context 
of the Fourier transform of distributions. 
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2.3 Basic Properties 


There are some useful operations that we can perform on locally integrable functions 
that can be carried over to distributions. A common operation is to shift a function f 
by an amount t to obtain t +> f(t — t). If we apply the change of variable à = t — t 
to the regular distribution associated with the shifted function we obtain 


[o0] [o0] 
[ te-vewa=f fogata. 
—oo —0o 
By generalizing this result we define the operation of shifting a distribution by 


(T(t = T), $) := (TM), t + 7). (2.17) 


With this definition we can for example denote a Dirac pulse at time t by ô(t — t) 


(S(t — T), b(t) := b(t). 


! Notation 


Note that a distribution T isn’t a function of the variable t. In spite of this it is useful 
to write T (t) to indicate the symbol used for the independent variable of the testing 
function (this will be useful when we’ll introduce operations such as the convolution) 
and as a convenient notation to indicate some operations such as shifting. In no way 
this is meant to imply the existence of a function or that the distribution is regular. 


Another useful operation is multiplication of the independent variable of a function 
by a constant a. By generalizing what happens with regular distributions, we define 
multiplication of the independent variable by a constant a for any distribution in 


D’(R") by 
1 
-ġ (+) k (2.18) 
jal a 


This operation is closely related to the concepts of even and odd distributions. 


(Tad, 9O) = (TO, 


Definition 2.5 (Even and odd distributions) An even distribution T is defined as a 
distribution for which, for every test function @ 


(T(t), pt) = (TE), 6). (2.19) 


Similarly, an odd distribution satisfies 
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(T(t), @(—t)) = =T), @@) (2.20) 


for every test function ¢. 


A further useful operation is multiplication of a distribution with an indefinitely 
differentiable function y. First note that multiplication of a test function @ with an 
indefinitely differentiable function results in another test function. For this reason 
we can again generalize the behavior of regular distributions and define 


(y T, $) := (T, y $). (2.21) 


2.4 Differentiation of Distributions 


At the beginning of Sect. 2.2 we mentioned that one of the distinguishing features 
of distributions is the fact that they can be differentiated any number of times. We 
also argued that, for regular distributions, partial integration leads to an expression 
which can be considered as the definition of the derivative of regular distributions 


(Ff, 6) = | "d 


= -f f(t) (rt) dt 
= (f,-6™). (2.22) 


In fact, this definition can be extended to singular distributions and to distributions 
of several variables, that is to arbitrary distributions. 


Definition 2.6 The first order partial derivative of a distribution T on D(R”) is 
defined by 
(DiT, ġ) := (T, —Diġ) i=1,...,n. (2.23) 


Since the derivative of a test function D;¢ is still a test function, it follows that 
the derivative of a distribution is always a distribution and that distributions can be 
differentiated an arbitrary number of times. 

With k an n-tuple of non-negative integers, the derivative of order |k| follows from 
the above definition 

(D‘T, $) = (-DM(T, D'4). (2.24) 


The order of differentiation is irrelevant since test functions have continuous partial 
derivatives of all orders and hence 


(D;D;T, ¢) = (T, Dj Did) = (T, DiDi) = (D; DiT, >) . 
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The rule for the differentiation of the product of a distribution T and an indefinitely 
differentiable function y is the same as the rule of differentiation for the product of 
two functions 


(Di(yT), 6) = —(yT, Did) = —(T, y Did) 
= —(T, Dily $)) + (T, Diy $) 
= (D;T, y $) + (Diy T, $) 
= {y DiT, o) + (Diy T, $) 
or 
Di(yT) = y DiT + (Diy)T. (2.25) 


Two important properties of distributional differentiation follow immediately 
from the definition. The first is that differentiation is a linear operation: given two 
distributions T) and T> and two numbers cı and c2 


D* (ci Ti + © Tr) = c DT +c DN. (2.26) 
The second is continuity: given a sequence of distributions (Tm)men converging 


toward a distribution T , the sequence of corresponding partial derivatives (D*Tn)meN 
converges to DT 


lim (D*Tn, ¢) = lim (1)! (Tn, D*ġ) 


m—> oo 


(—D(7, D‘¢) 
= (D‘T,¢). (2.27) 


In other words, the operations of limit-taking and differentiation can always be 
exchanged. In particular this means that if a sequence of partial sums Sm = S2”"9' T; 
converges toa series S = }°°° 7;, then the series can be differentiated term by term. 


! Notation 


To distinguish a regular distribution defined by the usual derivative of a function f 
from the derivative in the sense of distributions of the regular distribution defined 
by f, we are going to always denote the former by Ty or Tp p. The later will be 


denoted interchangeably by f®, D* f, T® or D*Tș. 


Example 2.7: Derivative of ô 


The first order derivative of the Dirac delta distribution is 


(6, p) = — (8, p?) = —6 (0). 
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The kth order one is 


(6, 6) = 1} 0). 


This example shows that, in general, to calculate the value (T, @) of a distribution 
T when applied to a test function @, it’s not enough to know the values of ¢ over 
supp(T). We need to know the values of ¢ over a neighborhood of supp(T). 


Example 2.8: Derivative of 1, 


The derivative of the Heaviside unit step 14 as a function is zero everywhere but 

at t = 0 (a set of zero measure) where it is undefined. The function 19 is therefore 

locally integrable, and we can define the regular distribution 7a) which evaluates to 
+ 


zero for every test function œ. 
Differently from this, the derivative of 1, as a distribution is defined everywhere 


and, applying the definition, we find 
12,8) = 14,0) == [6% adr = -0@IF = 00) = 6.4). 


that is 
1P = 6, 


Example 2.9: Function versus distributional derivative 


Consider a function f : R —> C continuously differentiable k times everywhere but 
at t = 0, where it has a discontinuity such that both limits 


‘ (i) i (i) 
an f~ (t) and im frm 
exist for all i < k. Let’s denote the difference between these limits by 
i = lim f” (t) — li O). 
ai = lim fO (f) — lim fOW) 
Then we may represent the function f as 
FO) = feo(t) + 00140) 


with f-o a continuous function. It is easy to see that for t Æ 0, iG (t) = fO). 
Thus, using the results of Example 2.8 the first order derivative of f is 
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ToP = Tro + ad. 


To compute the second-order derivative we can use the same procedure. We 
decompose the function f® into a continuous function f- ı and a step 


FOO = fer +0140. 
Differentiating term by term we therefore obtain 
T® = TU + a6 
= T + aT + apd 
= Ta + ad + ad? 
= Tra + ad + ad"? 


The kth order derivative can be obtained by iterating this procedure 


T® = Tjo +95“) + 54 H aad. (2.28) 


Example 2.10: Logarithm derivative 
In Example 2.6 we showed that the Cauchy principal value of 1/t is a distribution 
1 [e6] 
(pv =.) =- f mi Doar 
T —0o 


We now recognize this result as saying 


1 
py- = Dlin|t|. 
T 


Example 2.11: Limit to oo of trigonometric functions 
Consider the following parameterized distribution 


cos wt 
w >Q. 


folt) m 


As @ tends to infinity, it converges to 
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COS wt 
/ = p(t) dt 
supp(ġ) w 


lim |(fo,)| = lim 


w—> 
t 

< lim KAGON )| dt 

o> J supp)  @ 

t 

aim sup |(t)| K 

w—> o0 w 
=0 


with K = supp(@). Since distributions can always be differentiated and for distri- 
butions arising from continuous functions the derivative as a distribution coincides 
with the derivative as a function, we have the following result 


; 3 : cos wt 
lim (sin wt, ¢) = lim (—D Q) 
w—> w—> 0 w 
. _coswt 
= lim ( , Do) 
w—> 
=0 
or 
lim sin œt = 0. (2.29) 
w@—> o 
Similarly one obtains 
lim coswt = 0 and (2.30) 
w- 
lim e7” = 0. (2.31) 
w—> 


Note that these limits do not exist for the corresponding functions. 


2.5 Distributions with Compact Support 


The property of multiplication of a distribution with an indefinitely differentiable 
function suggests another interesting generalization. Let’s start again with a regular 
distribution f. Then, if we write out explicitly the integral of the distribution y f 


v.o)= f f(t) y(t) or) dt 
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we see that in principle we could group the functions differently and write (@ f, y). 
This is however not a distribution in D’ since y is not a test function as its support 
is not compact. In spite of this, the number (¢ f, y) is the same as (y f, @) for every 
test function @ and every function y € C. A moment’s reflection reveals that what 
makes these two expressions have the same value for every value of y is the fact that, 
as a function, @ f has compact support. 

To generalize this observation to arbitrary distributions we must first define what 
the support of a distribution T is. 

A distribution is said to vanish on an open set U € R” if (T, ¢) = 0 for all test 
functions @ with supp(@) C U, where supp(¢) is the support of the test function @. 


Definition 2.7 (Support of a distribution) The support of a distribution T is the 
complement of the largest open set U on which the distribution vanishes and is 
denoted by supp(T). 


The set of all distributions with compact support is denoted by &’ and forms a 
vector subspace of D’, that is 6’ C D. 


Example 2.12: Support of ô 


The value of the Dirac delta distribution ô applied to any test function @ with 
supp(¢) € U = (—o, 0) U (0, œœ) is zero. That is, ô vanishes on U. Its support 
is supp(6) = R \ U = {0} and is therefore compact. 


With the notion of the support of a distribution we can generalize our observation 
that (¢ f, y) = (y f, ) by saying that distributions with compact support T € 6’ 
can be extended to continuous, linear functionals L on indefinitely differentiable 
functions with arbitrary support. In this context the vector space of all indefinitely 
differentiable functions is denoted by & and, to give a meaning to the continuity of 
the functionals, it is equipped with the following convergence criteria. 


Definition 2.8 (Convergence in &) A sequence (Ym)nen € E is said to converge to 
y if for every compact subset K of R” and every n-tuple k, the set of functions D‘ yn 
converges uniformly to D‘ y 


sup DE Yn — D*y —> 0, m —> oœ. 
xeK 


Assume that the support of T is the compact set K and let œ be a test function 
equal to | in a neighborhood U of K. Then for every function y € & and for every 
point t E€ U, a(t) y(t) = y(t). Therefore, there is a functional L such that 


(Lyy)= (Tay) yeg. (2.32) 
That it is independent of the choice of a is easily verified: Suppose that a and a are 


two test functions equal to | in a neighborhood of K. Then in the smallest of these 
neighborhoods a, — a2 = 0 and (T, a, y) = (T, &2 y). 
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The functional thus defined is unique since, for every sequence of test functions 
Œm equal to 1 for |t| < m we have: on the one hand, by continuity of L 


lim (L, my) = (L, y) 


m> 


and on the other hand, for sufficiently large m 


(T, Amy) = (L, y). 


Therefore every distribution with compact support T defines a unique continuous, 
linear functional L on &. 

The converse is also true: Every continuous, linear functional L restricted to 
D C & defines a distribution T with compact support. For, if this was not the case 
and the support of T was not compact, then we could find a sequence of test functions 
Ọm E D with support in the complement of |t| < m, such that (T, m) = 1 for all 
m. However, since in & liMm—oo @m = 0, by continuity of L 


lim (L, n) =0. 


m> 


Therefore if (L, ¢) = (T, p) for all @ € D, then the support of T must be compact. 

There are other vector sub-spaces of D' which can be extended to larger sets 
of functions than D. We will encounter another one in the context of the Fourier 
transform. The set of test functions D are the common set on which all distributions 
are defined. 


2.5.1 Single-Point Support 


We now investigate distributions satisfying the following equation 
tT=0, (2.33) 
that is, distributions for which, for every k > 1 and test function @ 
(T, $) = (T,t' $) =0. 
For simplicity we limit ourselves to the one dimensional case. 
First observe that on the open set U = (—oo, 0) U (0, 00) the function t bh tk 


doesn’t assume the zero value. For this reason, to satisfy the equation, 7 must vanish 
on U, or, stated in other words, the support of T must be the origin: supp(T) = {0}. 
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Since the support of T is compact (a single point) and, for any test function ¢, 
the value of (T, @) is determined by the values of @ in a neighborhood (however 
small) of supp(7), we can expand ¢ using Taylor’s formula with remainder [19]. For 
our purposes it is convenient to express the remainder in integral form which can be 
obtained by integrating by parts multiple times 


b(t) = 60) + f $a) dt 
0 
= $0) — -DP (t—1) 6° (r) dt 


= $0) +t - +f e Y adr 


> dO m, fi G-de 


a Ma C-D Be) ar. 


m=0 


By performing the substitution t = fA the remainder can be transformed in the 
following form 


ae ae k=l IX _ tk 
f E-D (taA)drA = qp YO 


which makes it apparent that it is proportional to the product of t* and an indefi- 
nitely differentiable function y € &. Note that, differently from $, no addend has 
compact support. This poses no problem since T, having itself compact support, can 
be extended uniquely to a distribution on & (see Sect. 2.5). 

With this expansion we can express the value of (T, ġ) as a finite sum. Taking 
into account (2.33) 


+ (0) 
m=0 m! 

1 o™ (0) 
m! 


m=0 
k-1 


yao) 


m=0 


1 
(T, t”) + AT at YO) 


(T, $) = ET 


(T, t” ) 


or 


k-1 
T= a Cmô ™ (2.34) 
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with 


T (T, i”) 
Cm = (-1) ml : 


We have therefore established that the homogeneous equation 
*T =0 
has an infinity of non-trivial solutions, each being a weighted sum of the Dirac delta 
distribution 6 and its derivatives up to order k — 1. In addition, this shows that 6 and 


its derivatives are the only distributions with the support consisting of a single point. 


Example 2.13: Solutions of t T = 1 
We want to find all solutions of the equation 
tT=1. 
If T would be a function then the equation would have no solution at t = 0 and 1/t 


at all other points. From this we guess that the solution as a distribution could be 
T = pv1/t. Indeed, this distribution satisfies the equation 


lovio) = (ove) 
v—,~) =(pv-, 
p t P t 
1 
=li —to(t)dt 
om J wei p(t) 
oo 
= f p(t) dt 
—0o 
= (1, ¢) 
However, this is not the only solution as the homogeneous equation has non-trivial 
solutions given by (2.34) (k = 1). The equation is therefore satisfied by all distribu- 
tions of the form i 
T= PY +cô(t) 


with c an arbitrary constant. 
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Table 2.1 Main properties and operations on distributions 


2 Distributions 


(T, Dn Cm m) 
(Eri Cm Tm, o) 
(T@-— r), pt) 

(T (at), d(t)) 

(yT, p) 

(D‘T, ¢) 

lity 00(D* Tm, Ø) 
(DEO in Cm Tm), Q) 
(Sco D"8, $) 
(SOT, ¢) 
(Sx*T,ġ) 


ae Cm(T, dm) 

ea Cm(Tn, Q) 

(T(t), p+r) 

(T(t), are (4)) 

(T, y $) 

(CD(T, D‘6) 

(D‘T, $) 

ae Cm(D* Tn, @) 

0 

(S, (T, 6)) = (T, (S, $)) 
(S(t) ® TA), p(t +A)) 


The properties of distributions discussed in this chapter and some that will be 
discussed in the following ones are summarised in Table 2.1. 
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Chapter 3 A) 
Convolution of Distributions E 


The convolution product plays a central role in the description of linear and weakly- 
nonlinear systems. In this chapter we develop its theory based on distributions. In 
addition, the chapter introduces the tensor product which will also come to play an 
important role in the description of weakly-nonlinear systems. 


3.1 Tensor Product 


The general notion of convolution of distributions is defined in terms of the ten- 
sor product. We therefore start by defining this product and, as before, we start by 
considering regular distributions. 

The tensor product is a bilinear operation that can be used to generate a vector 
space out of other vector spaces. If f is a function on R” and g a function on R”, 
then the tensor product of f and g is defined as the function 


fOg:R”™ >C (Tt,å) > fga). 


The tensor product of two locally integrable functions is itself locally integrable. 
Therefore, if we now assume f and g to be locally integrable, we can try to build 
the tensor product of the regular distributions Ty and T, based on the tensor product 
f Q g. If we use as test function the tensor product of two suitable test functions £ 
and y then we obtain 


(Ty 8 Tz, § @ Y) = (Ty, €) (Ts, W) 


which is well-defined. However, for an arbitrary test function ¢ € D(R™*") it is 
not immediately apparent that the result is a distribution. Taking m = n = 1 for 
simplicity, we have 
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f(t) g(a) b(t, A) dadt 


8 —— 


(oe) 


f(t) f g(A) P(t, A) dardt. 


[o0] 
—0o 
[o0] 
—00 —0o 
The inner integral evaluates to a number for every value of the variable t, that is, 
it is a complex valued function of t that we call ¢(t). Furthermore, the variable t 
only appears as an argument of the test function ø. Therefore ¢ must have compact 
support. In addition, when computing its derivative, differentiation can be moved 
under the integral and ¢ is therefore indefinitely differentiable. In other words ¢ is a 
test function. We therefore have 


(Ty 8 Tz, 6) = (Tp, (Tg, $)) = (Tg, (Tr, $)) 


where the last equality comes from the fact that we could reverse the order of integra- 
tion without changing the result. This last property is referred to as Fubini’s theorem. 

The above arguments can be generalized to arbitrary distributions. That the inner 
functional is a function of t and that has compact support is clear. The fact that it 
can be differentiated comes from the continuity and linearity of distributions 


s(t te) — iC) 
€ 


(TA) OC +€,))) — (TA), 6, 4) 


D g(t) = lim 


= lim 
e>0 € 
-aro Eoen, 
= (T (à), Di (t, à)}). (3.1) 


With this we see that ¢ can be differentiated an arbitrary number of times and is thus 
a test function. We therefore obtain the following general definition for the tensor 
product of distributions. 


Definition 3.1 (Tensor product) Given two distributions S € D’(R”) and T € 
D' (R”) the tensor product S Q T is the distribution in D' (R”+”) defined by 


(S@T, @) := (S, (T, )) = (T, (S, $)). (3.2) 
It’s easy to see that the tensor product of distributions is bilinear 


(S+T)@U=S@U+TOU 


(3.3) 
S@(T+U)=S@T+SOU, 
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41 
and associative 
(S&T)QU=SQ8(T8QU). (3.4) 
As a useful abbreviation of notation we define the tensor power by 
T*:=TQ...QT, k>0 
— 
k times (3.5) 


TS :=1eC. 


Example 3.1: Higher Dimensional Dirac Pulse 


The tensor product of two Dirac pulses is 


(8 @ ô(T, A), b(t, A)) 


(ê CT), (5(A), (T, A))) = (8C), OCT, 0)) 
(0, 0) 
=: (ô(T, A), (T, A)). 


3.2 Convolution of Distributions 


We now come to the main objective of this section: the convolution of distributions. 
Remember that the convolution of integrable functions f, g € L! is defined as follows 


f * g(t) := i f() g(t —t)dt. 


To obtain a distribution we may write 
œo wo 

(f * 8,6) = f / f(t) g(t — t) dt b(t) dt 
oO — 00 


= | | tmsmoatnara 


which can be represented as the following tensor product 


(f * 8,0) = (F(T) @ 8), pA +T). 


However, while indefinitely differentiable, the function w(t, à) = @(A + T) is nota 
test function because its support is not compact. In fact, y(t, A) assumes the same 
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Fig. 3.1 Support of 
Wr, à) = br +A) 


supp(¢) 


supp(¢) 


value @(t) for every point on the diagonal line t = à + t of the (t, A)-plane (see 
Fig. 3.1). In spite of this, on account of our assumption that f and g are integrable 
functions (and not merely locally integrable), the above integral is well-defined. 
We therefore conclude that, similarly to the case of functions, the convolution of 
distributions only exists for a subset of distributions with additional characteristics. 


Definition 3.2 (Convolution) Given two distributions S and T in D’(R"), if for every 
test function @ € D(R") the tensor product S ®@ T can be extended to functions of 
the form y(t, à) = @(t +A), then the convolution product S x T is defined by 


(Sx T, @) = (S(t) @ TA), OC + A)) (3.6) 


and is commutative 
S*xT=TxS. (3.7) 


A sufficient condition for the existence of the convolution is as follows: if the 
intersection of the support of S @ T, that is supp(S @ T) = supp(S) x supp(T) and 
the support of y(t, A) = ġ (t + A) is bounded, then S x T is well defined. In other 
words, if for t € supp(S) and A € supp(T) the sum t + A can only remain bounded 
if both t and A remain bounded, then the convolution product S$ x T is well defined. 

Note that this condition is sufficient but not necessary as shown for instance by 
the introductory example with integrable functions f, g € L'. In fact the convolution 
f * g of integrable functions does always exist and is itself an integrable function 


Kf x8 p) = f(T) 8A) GA +7) dt dr 


i 
3 — e 


< sup |¢| f Faldt f pUJ: 
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Fig. 3.2 Support of 
S(t) ® T(A) and of 


supp(S(r) 8 T(A)) 
w(t, A) = b(t +A) i 


P(t) 
supp(¢) 


supp(¢) 
supply) —= ` 


Example 3.2: One Sided Distributions 


A subset of the real line U € R is said to be bounded on the left if there is a real 
constant b such that U C (b, œo). Similarly, a subset U is called bounded on the right 
if there is a constant b such that U C (—ooọ, b). 

Distributions whose support is bounded on the left (right) are called right-sided 

(left-sided) distributions. The set of all such distributions forms a vector space 
denoted by D'r (D'L). Of particular interest for our purposes are right-sided distri- 
butions T with supp(T) € [0, oo). We denote the space of all such distributions by 
D. 

Tiie 3.2 shows the support of S(t) T (à) and of Y (t, A) = ġ (t + A) for two 
distributions S and T in D'_. It is clear that, for any test function @, their overlap is 
always bounded. Therefore, the convolution of right-sided or left-sided distributions 
is always well defined. Not so the convolution of a left-sided distribution with a 
right-sided one. 


Example 3.3: Convolution with ô 


Let T be any distribution in D'(R”) and ô the n dimensional Dirac pulse (see 
Example 3.1), then 


(T =ô, $) = (T(t) 8 ê), b(t +A) 
= (T(t), (8), í + A))) 
= (T, ġ$) 
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or 
Txô=T. (3.8) 


Thus, ô is a unit of convolution. 
Similarly, for any |k|th order derivative of the Dirac pulse 


T(t) ® D‘8(A), p(t +A) 

T(t), (D‘5(A), ot +A))) 

T(r), (8A), (CDF Dip (t + A))) 
T(t), (8A), (—1)"!DEb(t +. d))) 
T(t), (—1)"'D*$(z)) 

D‘T (1), b(t)) 


(T x D'S, p) 


={ 
=( 
= 
= ( 
= ( 
= ( 


or 
T x D's = D‘T (3.9) 


where Dk and DŁ mean differentiation with respect to the variable à and t, respec- 
tively; and we made use of the fact that Dk b(t +A) = DE b(t +). 


In our notation we use T (t) to indicate a distribution to be associated with a test 
function whose independent variable is indicated by the symbol t. With this notation 
it seems natural to write S(t) x T(t) to denote the convolution of two distributions. 
However, when we build the convolution of two shifted distributions this leads to 
confusion as S(t — a) * T (t — a) does not represent the distribution S « T shifted by 
a. To give a precise meaning to such expressions we introduce the shifting operator 
defined by 


(TaT, O(t)) := (T(t), pt +a). (3.10) 


With it we fix the following notation 


(Sx T)(t — a) := Ta(S x T) (3.11) 
S(t—a)*x T(t — b) := TS * TT. (3.12) 


The convolution product has several useful properties. The first one that we want 
to discuss is distributivity. If all appearing convolutions are well defined, then 
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((S + T) « U, 6) = (S(t) + T(t) @ UA), b(t +A)) 
= (S(t) + T(t), (UC), (t +A))) 
= (S(t), (UA), p(t +A))) + 

(T(t), (UQ), b(t +))) 
= (Sx U, ġ)+ (T * U, >) 
= ( 


S*U+T *«U,}¢) (3.13) 
and similarly 
S*x(T+U)=S*T+S«U. (3.14) 


Further, differentiation of a convolution product is equivalent to differentiation of 
one of the products 


(Di(S x T), 6) = —(S * T, Di¢) 
= —(S(t) @T(A), Drip (t +A)) 

—(S(t), (TO), DriG(t +A))) 

= (Di S(t), (T), (t + A))) 

= ((D;S) * T, ¢) 


where Dz, is the partial differential operator with respect to the ith component of the 
variable t. Since D} i(t +A) = Dy i(t + A) differentiation can also be moved 
to the second factor so that 


D(S * T) = (D;S) * T = S x (DT). (3.15) 


In a similar way one shows that the operation of shifting a convolution product 
can also be moved to one of the factors 


(S « T)(t — a) = S(t — a) * T(t) = S(t) X T(t — 2). (3.16) 


Example 3.4: Convolution with ô 


Consider two Dirac pulses and an arbitrary distribution T in D’(R"). By the shifting 
property of convolution we have 


6(t — a) *6(t — b) = (t —a — b) (3.17) 
T(t) *d(t —a) = T(t—a). (3.18) 
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The convolution of a distribution T with an indefinitely differentiable function y 
is an indefinitely differentiable function. For, by keeping in mind that @ has compact 
support and therefore, as a distribution can be uniquely extended to functions in &, 
we have 


(T * y, $) = (T(t), (vO), P(E + A))) 
= (T(t), (YA = T), @A)) 
= (T(t), (PA), YA — T))) 
= ($), (T(t), yA = T))) 


Then, by arguments similar to the ones that led to the definition of the tensor product, 
one deduces that the inner distribution is an indefinitely differentiable function that 
we call ¢. We can therefore proceed further 


(T * y, $) = (PA), FA) 
= (¢, $) 


and obtain as claimed that T x y = ¢. 

The convolution product is a continuous operation in the following sense. If T is 
a fixed convolution, (Sm)men a sequence of distributions converging in D’ to S and 
all involved convolutions are well defined, then 


lim (Sm * T, 0) = lim (Sn (T), (TA), PE +) 


(S(t), (TA), (t +A))) 
= (SxT,¢ġ) 


or 
lim S,*«T=S*«T. (3.19) 
m—> oo 


In particular we saw in Example 2.5 that ô can be represented as the limit of a 
sequence of test functions 6m and in Example 3.3 that ô is a unit of convolution. 
With continuity of convolutions we therefore deduce that each distribution is the 
limit of a sequence of indefinitely differentiable functions of the form T * @ with o 
a test function. We saw an instance of this in Example 2.2. 

The last property that we want to discuss in this section is associativity. In general 
the convolution of three or more distributions is not associative as is easily verified 
with simple examples. 


Example 3.5: Convolution may not be Associative 


Let’s denote by | and 0 the constant functions evaluating to one and zero, respectively. 
Then 
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Lx (6) #14) =1*6=1 
(1* 6) *1, =0*1, =0 
(6 *1,)*e1l=d*1l=1 
6) x (1, * 1) = undefined . 


We can guarantee associativity by imposing a restriction similar to the one for 
the existence of the convolution of two distributions. Let’s write the convolution of 
three distributions in terms of the tensor product 


(Sx T *U,) = (S(t) @T(A) @ Ulk), (THA +K). (3.20) 


If the intersection of the support of S(t) ® T(A) ® U (k) and the support of (t + 
à + K) is bounded, then, by the properties of the tensor product, the convolution is 
guaranteed to be associative. It’s easily verified that the following is a sufficient con- 
dition: if all, but possibly one distribution have compact support, then the convolution 
product is associative. 


Example 3.6: One-Sided Distributions 


Consider three distributions S(t), T (A) and U(x) in Din Then 7, A and x are > 0. If 
the value of t + 4 + « is bounded then there is a constant c for which t +À +x <c. 
It follows that t is bounded by t < c — (A + «) and similarly for the other variables. 
The convolution of distributions in D’, is therefore always associative. This is also 
true for distributions in D’r and D’;. 

In addition, it’s easily seen that a. is closed under convolution. That is, a con- 
volution between distributions in D’, results in another distribution in D’. 


The discussed properties of the convolution product are summarized in Table 3.1. 


3.3 Approximation of Distributions 


In this section we show how the convolution product can be used to obtain approxi- 
mations of arbitrary distributions. 

We saw that if T is a distribution in D’ and ¢ is a test function in D then the 
convolution product T x ¢ is an indefinitely differentiable function. Its support is not 
necessarily bounded. However, let œ be the test function defined by (2.11) and set 
Q(T) = a(t/m). Then for every m € N the product œm - (T * $) is an indefinitely 
differentiable function with compact support and hence a test function. 
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Table 3.1 Properties of the convolution product 


S*T = T*S 


S*(T+U) = SxT+SxU 


(SxT)xU = Sx(T xU) 


Dk(S*T) = (D‘S)*T = Sx (D*T) 


(S*T)(t-—a) = S(t —a)* T(t) = S(t) * T(t —a) 


limp—soo Sn *T = S**T 


T(t) *6(t —a) = T(t—a) 


Tx D'S’ = D*‘T 


Let (6,,) be a sequence of test functions converging to the 6 distribution. Then, 
with the continuity of convolution, we see that in D’ 


lim a, -(T * Bn) =T. (3.21) 


This shows that every distribution in D' is the limit of a sequence of test functions in 
D. In other words, D is a dense sub-vector space of D'. Every distribution can thus 
be approximated to an arbitrary accuracy by a test function in D. 

Next we construct another dense sub-vector space of D’. For simplicity we only 
treat the one dimensional case and for brevity we write Km for a, -(T * Bm). AS we 
just discussed «n is a test function for every m € N. Let ¢ be another arbitrary test 
function. Then, for every m, we can find constants a and b such that the interval 
[a, b] includes both, the support of Km as well as the one of @¢. If we construct the 
finite sum of 6 distributions weighted by k,, 


b-a% b-a b-a 
Sam = oF i ——)b(t-a-j 
= 2 (a+ j——)st -a—- j—_) 
and apply it to we obtain 
b-a č b—a b-a 
(Sam: 9) = —— Y km(a + j —)b(@ + j—). 
n n n 


In the limit as n tends to infinity we obtain 


b 


e E E T BORDA: 


n> œ 
a 


By the choice of the interval [a, b] we can extend it to the whole of R without 
changing the value of the integral. Hence, by letting m tend to infinity we finally 


3.4 Convolution of Periodic Distributions 49 


obtain 


b 
Him, f OAO de = lim (kn. ) = (T, 0). 


We thus see that every distribution T € D’ is the limit of a finite sum of weighted 
Dirac pulses S, := Syn. That is, finite sums of weighted ô distributions form a dense 
sub-vector space of D'. 

Note that a regular spacing between the 6 distributions is not necessary and was 
chosen purely for convenience. In general any distribution can be approximated by 
a finite sum of the following form 


Tr = X anj 5 — T,j) (3.22) 
j=l 
with a, ; € Cand m, j E€ R. 


3.4 Convolution of Periodic Distributions 


In this section we investigate periodic distributions and their convolution. One way to 
define periodic distributions is to define them in a similar way as periodic functions. 


Definition 3.3 (Periodic distribution I) A periodic distribution T is a distribution 
for which there exist a positive number F such that for all test functions ¢ 


(T(t), 6(t)) = (T(t +T), $). (3.23) 


The smallest such number 7 is called the fundamental period of the distribution. 


Periodic distributions have unbounded support. For this reason the convolution of 
two periodic distributions as defined by (3.6) does not exist. By exploiting their peri- 
odicity it is however possible to find an alternative definition for periodic distributions 
that allows for a well defined convolution product. 

Consider a regular distribution arising from aT -periodic function f. By exploiting 
its periodicity we find that 
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(Fp) = J f(t) p(t) dt 
5 pe a+(m+1)T 
> f fo wdr 
m=- ot mT 
att RA 
= i fi) So o@-—mT)at 
aiT 


= / f(t) ®(t) dt 


with a a constant, 


(t) = 5 o(t —mT) (3.24) 


m>=—CO 


and where the exchange of summation and integration is justified by the fact that 
for every value of ¢ the sum is finite. The function ® is 7 -periodic and indefinitely 
differentiable. 

By introducing the identity 


fMO= ft) GER (3.25) 


with [t] the equivalence class of real numbers modulo 7 , we effectively and uniquely 
define a function f°. By writing [t] as T / (27 )[p] and noting that [g] is an equivalence 
class modulo 27, we can think of f° as a function defined on a circle of radius 
T /(27) at the origin of a plane, with [g] the polar angle. With this interpretation, 
the equivalence class [t] is seen to represent the distance along the arc of the circle T 
from the reference [0]. In the following, to simplify notation, we are going to write 
a representative for an equivalence class. 

Conversely, given a function f°, the identity (3.25) uniquely defines a periodic 
function f (see Fig.3.3). The last integral above is therefore identical to the integral 
of f° $° on the circle T 


at+T 


I FODA at = f F(p) ©°(p) dp. 
T 


a 


We have thus obtained that to every regular periodic distribution f there corre- 
sponds a continuous linear functional f° on indefinitely differentiable functions ®° 
on the circle T. The set of all the latter functions is denoted by D(T). This space 
is isomorphic to the vector sub-space of & consisting of all indefinitely differen- 
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Fig. 3.3 Periodic function 
versus function on T f (t ) 


Fp) 


tiable 7 -periodic functions ® and from which it inherits the following definition of 
convergence. 


Definition 3.4 (Convergence in D(T)) A sequence of functions ®°, € D(T) is said 
to converge to ®° € D(T) if, for every natural number k, the functions D*®?, con- 
verge uniformly to Dk &°, 


In Sect.3.2 we saw that every distribution T can be generated as the limit of a 
sequence of regular distributions fm. Applying this result 


(T, p) = lim (fn, 6) = lim / fà (P) ®°(p) dp = (T°, $°) 
T 


we see that not only regular, but every periodic distribution can be equivalently 
represented by a continuous, linear functional on D(T). 

To see the converse, that is that every continuous, linear functional on D(T) rep- 
resents a distribution on D(R), we have to show that every indefinitely differentiable 
T -periodic function ® can be generated by some test function ¢ as in (3.24). To this 
end we introduce the so-called unitary functions. These are test functions for which 
there is a number F such that 


X &(@- mT) = 1. (3.26) 


Note that, here again, the sum is finite for every bounded range of t. We can find 
several such functions. The following example satisfies (3.26) for7 = | (see Fig. 3.4) 


1 e 1 -1 
emo dt edt |t| <1 
saadi ! j 
0 


lt} > 1 
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Fig. 3.4 Example unitary 
test function 1.27 


0.8 F 


With it we can construct unitary functions for arbitrary periods 7 by r(t) = 
§i(t/T). 


Now, given any 7 -periodic test function ® and an unitary function £y we have 


O(1) = O(1) D> rt- mT) 


m=—CO 


= Do &(t—mT) t- mT). (3.27) 


m=— 00 


Since &7 is a test function, so is & ®. We have thus established that every indefinitely 
differentiable periodic function ® can be represented as the sum of a test function 
p. We conclude that periodic distributions are in one-to-one correspondence with 
continuous, linear functionals on D(T). It is therefore natural to call these functionals 
distributions on T. They form a vector space that is denoted by D' (T). 

We now have a second way to define periodic distributions. 


Definition 3.5 (Periodic distribution II) A periodic distribution T is defined by 
(T, $) := (T°, $°) (3.28) 


with T° a distribution in D' (T) and where ¢ and ©° are related by Eqs. (3.24) and 
(3.25). 


This definition is compatible with the first one since replacing ¢(t) by pt +T) 
doesn’t change ®°. 


Example 3.7: Dirac comb ôy 
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Consider the distribution in D' (T) defined by a Dirac pulse 5° with support consisting 
of the point at arc-length p = 0. Its value on a test function in D(T) is 


(5°(p), ®°(p)) = B°(0). 


The corresponding periodic distribution in D' (R) is 


CO 
ôr (t) = 5 b(t +mT) 
m=—OoO 
and evaluates to the same value as 6° 
CO CO 
(x, JO C+M = $ 6 +mT), Ert) (0) 
m=—Co m=—OoO 


= D> &(-mT) &(-mT) 


m=— 00 


DO) X` ér(-mT) 


= ° (0). 


Since the support of distributions in D' (T) is bounded, with the second definition, 
the convolution of periodic distributions is always well-defined and associative 


(S* T,@) = (S(t) @ TA), p(t +A)) 
= (S° (T), (T? (A), P(t +A))) 
= ($° x T°, ®). 


In addition it’s easily verified by replacing ¢ (t) by (t +T) that the resulting dis- 
tribution is also J -periodic. In other words, D' (T) is closed under convolution. 
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Chapter 4 A) 
Fourier Transform of Distributions tect 


The Fourier transform is a major tool in the analysis of signals and systems. We will 
see that its extension to distributions will make the derivation of many results simpler 
and more direct than when working with functions. 


4.1 Test Functions of Fast Descent 


Consider a Lebesgue integrable function f. Its Fourier transform is defined by 
00 
F{f\(@) = fo) := I fOe” dt (4.1) 
—00 
which is a continuous function of w. If f is such that F is also integrable, then 
00 
FO =F MAO := n f fo e do (4.2) 
—o0 


almost everywhere, with F~ !{ Ô } the inverse Fourier transform. F~ !{ Ô } may differ 
from f at the points where f is not continuous. 
The Fourier transform of the regular distribution f is thus 


J / fOe dt d(@) da 


TOKO) 


—00 —00 
[0.6] [o.e) 
= J f(t) J olw) e / dwdt 
—0o —0o 
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= f SOG) dt. (4.3) 


The last integral looks like a distribution in a form suitable to be generalized to arbi- 
trary distributions. However, the support of ¢ is not compact. This is a manifestation 
of the uncertainty principle of the Fourier transform and can readily be seen by 


$ b oo m 
low) = f omemar= | X H ona 


—a 


œœ on b 
zy = / g(t) 1” dt (4.4) 
m=0 ` A 


with a and b constants such that the interval [a, b] includes the support of ¢. 

To obtain a definition of the Fourier transform suitable for arbitrary distributions 
we have to replace the space of test functions D with a space closed under Fourier 
transformation. Suitable characteristics for the functions in this space can be inferred 
from the above expression for ĝ. First, given the uncertainty principle, the space has 
to be extended to functions of unbounded support (and therefore, the last step, the 
exchange of summation and integration, may not be valid). Then, if all summands 
have to remain finite, the limits lim,;.+.. ¢ (t) t” have to converge to zero for all 
values of m. Finally, to preserve arbitrary differentiability, the above characteristics 
must be satisfied by all derivatives of ¢. These are the characteristics of the so-called 
Schwartz space S of which we give the general definition. 


Definition 4.1 (Schwartz space S(R")) The Schwartz space S(R”) is the vector 
space of indefinitely differentiable functions ¢ : R” — C that, together with all their 
derivatives, decrease more rapidly than any power of 1/|t| as |t| — oo. That is, 
for any n-tuples m, k € N” and t € R” 


lim |r” D'g(t)| =0. (4.5) 
|t| +00 


Functions ¢@ in the Schwartz space are called test functions of rapid descent, or 
Schwartz functions. 

To see that the Fourier transform of a function @ € S(R) is indeed another function 
in the same space, consider the kth derivative of ĝ. By integrating by parts we find 
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D‘¢(w) = / (CJD p(t) dt 


aus e/"D[(—st)' b(n) dt 
Jo 


and by iterating m times 


Go" Rw| =| | D” [EJO] at 


IA 


D” [Ät @@)]| dt. 


Since this is valid for arbitrary k and m it shows that È is in fact a function in the 
Schwartz space. In addition, given that the Fourier transform, and its inverse are 
almost symmetric, a similar calculation shows that the inverse Fourier transform of a 
Schwartz function b is a function ¢ € S. That is, the Fourier transform is a bijection 
from the space S into itself. 


Example 4.1: Gauss Function 


An important example of a function of rapid descent is the Gauss function 


1 
V 210 


It’s widely known that its Fourier transform is 


e7 / 20°). 


p(t) = 


Glo) =P"? 


One of the defining characteristics of distributions is their continuity. To talk about 
continuity we introduce a convergence principle (topology) similar to the ones we 
defined for D and &. 


Definition 4.2 (Convergence in S(R")) A sequence of functions dm € SCR”) is said 
to converge in S(R") to a function @ € S(R”), if for each n-tuples k, p € N” and 
t € R” the sequence |t|? D‘¢,,(t) converges uniformly to |t|? D*¢ (T), that is if 


lim |t|? Dédm(t) = |t|? D*$(1). 
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4.2 Fourier Transform of Tempered Distributions 


For (4.3) to be an expression suitable for the definition of the Fourier transform for 
an arbitrary distribution, we must verify its linearity and continuity. The former is 
clear. To show the latter we have to verify that, if a sequence of test functions m € S 
converges to zero, so does the sequence of their Fourier transforms Êm. That this is 
the case is shown by the following upper bound 


eal: = | 1 n) dt < f AEE / Ibn (t)| at 
Itl<l 


\t]21 


2 
<2suplon(o+ f 2% ar 
|t|=1 


|t|<l 


1 
<2suplom(O sup OP di 
nes z1 lt“ 


Itl<1 
= 2 sup [óm (t)| + 2 sup [Øn (0) tl. 
Įt]<1 21 
We have a good candidate for the definition of the Fourier transform for an arbitrary 


distribution. However, since the space S is larger than D, the Fourier transform can 
only be defined for the following subset of distributions. 


Definition 4.3 (Tempered distributions) Tempered distributions (also called distri- 
butions of slow growth) are distributions that can be extended to continuous, linear 
functionals on the Schwartz space S. 


The space of all continuous, linear functionals on S is denoted by S’ and, since the 
Schwartz space S is a subspace of &, we have the following inclusion: &' C S’ C D. 
Consequently, from Sect. 2.5, we conclude that, if a distribution T € D’ can be 
extended to a continuous, linear functional on S, then this extension is unique (and 
the other way around). S’ can therefore be identified with tempered distributions. 

Example 4.2: Slowly Increasing Function 
Consider a locally integrable function f satisfying 
IFO < Ct!" as |t| > co 


for some constant C and some natural number m. Then, f is a tempered distribu- 
tion, since 


KED) zi IOLO + f fod 


It] 


c 
< sup lOl f  If®Oldt + sup (tocol) f © at 
It}<1 |t}<1 |t|=1 t21 |t| 


is bounded for every ¢ € S. 
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Example 4.3: Distributions in &’ 


Distribution with bounded support are defined on all indefinitely differentiable func- 
tion, independently of their asymptotic behavior as t — oo. For this reason the 
Fourier transform of distributions in &’ is always well-defined. 


Example 4.4: Multiplication with Polynomial 


If T is atempered distribution and p a polynomial, then p T is a tempered distribution. 
pT is in fact defined as 


(pT, o) =(T, pd) 


and it’s easy to see that pd € S. 


We can finally define the Fourier transform for tempered distributions. 


Definition 4.4 (Fourier transform on S’) The Fourier transform of a tempered dis- 
tribution T and its inverse, are defined by 


(F{T}, b) := (T, F {¢ġ)}) (4.6) 
(FHT), p) := (T, F~{o}) (4.7) 
for every function @ € S. 


Clearly, the Fourier transform of a tempered distribution is a tempered distribution. 
Note that, given the properties of Schwartz functions, for a tempered distribution it’s 
always the case that 


FUE PSAP Tanta 


In addition the Fourier transform and its inverse satisfy the following symmetry 
relation 


(F {T},¢) -rron {ro f acoema) 


= (T (w), 21 F ~'{6}(—o)) 
= (2n F7 HT (—0)}, $). 
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If in this expression we replace T by its Fourier transform and denote it by T, then 
this symmetry relation can also be expressed as 


F [PO] = 27 T0). (4.8) 


As with functions, we will often use the convention of denoting by T the Fourier 
transform of a tempered distribution T. 


Example 4.5: Fourier Transform and ô 


The Fourier transform of the delta distribution 6 is 


(3.0) =(5,6)=6@ = f (Ddi = (1,4) 


or 


o> 
II 
= 


Conversely, the Fourier transform of the constant function 1 is 


(1.4) =(1.4)= f odo = 20 (5, F) = 20 84) 


or 
Î = 278. 


This expression is often found in the technical literature symbolically written as 


le lf 
O= zy fer da= = f ede. 
—0o —oo 


The Fourier transform of the derivative of ô is 
(F (D8}, 6) = (D8, $) = - (3, - 08) = jo(3,4) = Uo, g) 
and, by iterating this procedure, for the higher order derivatives we find 


F {D18}; = Go)". 
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Example 4.6: Complex Tones 
The Fourier transform of a complex tone is 


(oe) 


Feela =(e™, 6) = I el b(t) dt = 27 p (0) 


=2r (ê (w — a), $) 
or 
F {el} = 27 (w — we). 
Similarly, the Fourier transform of a shifted Dirac pulse is found to be 


F {8(t —t))} =e, 


Example 4.7: Dirac comb 


An equally spaced sequence of Dirac pulses is a tempered distribution called a Dirac 
comb with period T 


[0,0] 


srt) := X` êt- mT). 


m=—OCO 


The linearity and continuity of distributions permit to calculate its Fourier transform 
term by term and, by using previous results, we obtain 


loo) 


F {êr} = 5 elomt 


m=— 00 
This distribution is formally the limit 


im, (sp K (Œ) + sv,m(@) — l, o(@)) 


with 


K-1 


Sp,K(@) —— 5 font 


m=0 
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M-1 


Sv.m(@) = 5 eter 


m=0 


For values of w Æ k 22/7 ,k € Z the partial sums can be represented by 


t= eloKT 1 eJeKT 
SP.K(@) = 1 — e/oT E 1 — eet 1 — e/eT 

_ e JOMT 1 e JOMT 
Sum (@) = l—e se J —e eT | — evor" 


The sum of the first terms is easily seen to equal | and, with the results of 
Example 2.11, the limit of the second ones do vanish. The support of F {ôy} there- 
fore consists in the set of points w = k 2r /T ,k € Z. Consequently, when applied 
to any test function ¢ € S, its value must be a weighted sum of the values of the 
test function at these points. Since, replacing ¢ (t) by @(t + T) doesn’t change the 
result, we can also deduce that the weighting factor must be the same for all terms. 
We thus have 


(F iôr}, p) = D> Como.) = C (8v P) 


m=— 00 


with C a constant and w, = 2x/T. The value of the constant can be found by 
inserting any Schwartz function. A convenient choice is the one of Example 4.1 with 
o =Ẹs/2r/T. With it, on the one hand we have 


(oe) 


J 2 I 2 
F {6 = Sw, Ja (tT) / (40) = X m'n 
i r i Á c{ i On , 20 noe 


and on the other hand 
oe) 
(F {ôr}, p) = (5, ô) = (sr, ener) E >» enn 
m=— 00o 


so that C = 27x/T. We have thus established the following important result 


F {8T} = We Sx, - (4.9) 


The Fourier transforms of the ô and related distributions are summarized in 
Table 4.1. 

A useful property of the Fourier transform is that it preserves parity. This means 
that the Fourier transform of an odd tempered distribution T is odd 
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Table 4.1 Fourier FIT} 
transformation of the ô and 
related distributions 1 
1 276 
Dk5 Go) 
(—yt)* 2n D*8 
Ô(t — Te) e JOT 
e/ et 21 5(@ — wec) 
ôr a bn /T 


(F iT}, pCt) = (T, F i(t) = 


(r fsc near) 
=(r _ [ ocerar) 
T, 


=(7,6(-«)) = - (7,4) 
=- (F (T), 6) 


and, similarly, the Fourier transform of an even tempered distribution is even. 

We conclude this section with an important property of the Fourier transform of 
real distributions. Let T be a real distribution, ¢ a real valued Schwartz function and 
let denote complex conjugation by an over bar. Then 


or . 
T(@) = T(-@) (4.10) 


as for real functions. 


4.3 Distributions with Bounded Support 


The Fourier transform of distributions with bounded support can be expressed in a 
simpler, useful form that we explore in this section. To this end, consider first the 
convolution between a distribution of bounded support T and the regular constant 
distribution 1 
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(sT, $) = T ag=(To, [ oe + ar] 
= (1, (7, 4)) = | Toge +9) dd 


and note the two equivalent integral representations. With these equalities we can 
then proceed to represent the Fourier transform of T by 


(7,4) = (ro. Í poe” do) 
= | TO, we") do 
= [ (70.61) (@) de 
= re. e™®), b()) 


with . 
Tw) = (TE), e) (4.11) 


an indefinitely differentiable function of slow growth. In a similar way we obtain 


F-NT}= (To) e) (4.12) 


4.4 Fourier Transform and Convolution 


Consider a tempered distribution S and a distribution with bounded support T. Their 
convolution is well-defined if, for every test function ġ € S 


(Sx T, p) = (S(t), (TA), pí +A))) = (TA), (S(t), (+ A))) - 
In the first case, 
(T(A), b(t +A)) 


is a function ¢(t) € Sand the outer functional is therefore well-defined. In the second 
case, 


(SQA), b(t + A)) 


is an indefinitely differentiable function y(t) € &. Consequently, the outer functional 
is again well-defined. Equality of the two expressions is guaranteed by the uniqueness 
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Table 4.2 Properties of the T 


Fourier transformation Fm z 
Ep an Tn Ep am În 
SxT ST 
ST LS «T 
T(t—T) Te Jet 
Tel®ct T (w — we) 
D‘T Go T 
(—ytk T D‘T 
T 2n T(—w) 


of the extension of the distributions T and S to &' and S’ respectively (and the other 
way around). 

We now show that the Fourier transform of the convolution of S and T is the 
product of their Fourier transforms § and 7: 


(F {ST}, p) = (S*T,F {p} = (S, (T, F {})) 


( 
=(sw,(70, | goe ar)) 
R 
= (so. f p(t) (TA), e) ee ar) 
R 


= (F {S(@)}, 6 (TA), e) 
(F {S@)} (TA), e™), eo) 


or ii 
F{S«T}=8T. (4.13) 


The product is well-defined since T isan indefinitely differentiable function of slow 
growth. A similar result is readily obtained for the inverse Fourier transform 


F'US*T} =2r F HSF HT}. (4.14) 


These are central results and arguably the most important properties of the Fourier 
transformation. It can be shown that this relation is valid in other cases as well. For 
example, in the case of locally integrable functions which are slowly increasing [20]. 

With these properties, the previously obtained Fourier transforms for the Dirac ô 
distribution and the properties of the convolution product we immediately obtain the 
properties listed in Table 4.2. In particular, it’s noteworthy the fact that the Dirac 6 
distribution acting as a unit with respect to the convolution product is related to the 
fact that its Fourier transform is 1. 
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Example 4.8: Fourier Transform of pv 1/t 


In Example 2.13 we saw that the equation 
tT=1 


has solutions 
1 
T = pv 7 + Cô. 


with C a constant. By noting that pv 1/t is odd, while ô is even, we can find the 
Fourier transform of the former. First observe that 


F {(—jt)T} = DT 
and hence, by transforming both sides of the equation we have that 
| DT =2n 68. 


Since the Fourier transform preserves parity, we have to look for an odd solution of 
this equation, and we find 


J 


T(w) =-ymsign(w) or F [pv L| (w) = sign (w). 
wt 


With this result and the symmetry of the Fourier transform (Eq. (4.8)) we also find 


F {sign} (w) = 27 pv u = pv—. 
m (—w) Jo 


Example 4.9: Fourier transform of 1, 


The Heaviside step function 1, can be written as 


1 
1.@= 5 [1 + sign(t) | : 


Its Fourier transform is therefore 
1 2 1 
F {14H} = = | 27 6(~) + pv — | = x ô + py —. 
2 JO Jo 


From the symmetry of the Fourier transform we also obtain 
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F{x5+pv—} = 27 1,(-a) 
J 


or 


4.5 Periodic Distributions 


In this section we investigate the Fourier transform of periodic distributions. Consider 
first a regular distribution arising from a locally integrable periodic function f. If we 
introduce a function fn 


fQHa<t<a4+T 
otherwise 


HA) = k 
with a a constant, then f can be expressed as a convolution product 


fO = HE) * êr. (4.15) 


The Fourier transform of f can therefore be written as the product of the transforms 
of fn and 67, which is well-defined since f, has compact support 

In © 
F | fn, 677") 8(@ — ma). 


m=— O0 


FAF} = (fa, e”) odo, = 


From this we see that the Fourier transform of f consists of a train of equally spaced 
Dirac pulses, each weighted by a numerical coefficient, and that this set of weighting 
numbers fully characterize it. 

If we now represent f as the inverse Fourier transform of F { f} and make use of 
the results of Example 4.6, we obtain a trigonometric series 


fO = TÈ fn, "| FTH — me)} 
= dif fa, e772) emee, (4.16) 


I aas 


called the Fourier series of f. The coefficients are values obtained by evaluating fn 
on indefinitely differentiable periodic functions which are members of D(T) and the 
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values are identical to the ones obtained by evaluating the distribution f° € D'(T) 
corresponding to f (see Sect. 3.4) on the same functions. Consequently, the above 
trigonometric series is both a representation of a periodic distribution in D' (R) as 
well as that of a distribution in D’(T). 

These arguments can be extended to general periodic distributions without any 
difficulty so that we have the following general definition of the Fourier series of a 
periodic distribution. 


Definition 4.5 (Fourier Series) The Fourier series of a distribution T° € D' (T), or 
a periodic distribution T € D' (R), is the trigonometric series 


CO 
x Cpe” 2P (4.17) 
m=— 00 
with coefficients 
Cn = 1 (Tenta). (4.18) 
T 


The coefficients are called the Fourier coefficients of the series. 


The Fourier series is the only trigonometric series that converges to the distribution 
T° in D' (T). In fact, if for any ® € D(T) the series 


(oe) 


> din (elmer, D) 


m>=—-CO 


does converge, then by putting ® = e /””-? and using the orthogonality of trigono- 
metric functions we find that 


(T°, eme) = Tdp 


which shows that the coefficients dm correspond to the Fourier coefficients of T°. 

As every distribution, the Fourier series of a distribution can be differentiated 
term by term. Therefore, if we designate by c,,(T°) the mth Fourier coefficient of 
the distribution T°, we have that 


Cm(D‘T°) = (mae) Cm (T°). (4.19) 
A natural question to ask is: How do we know if a certain trigonometric series 


converges to a periodic distribution? To answer this question first note that the series 
of numbers 
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is absolutely convergent. Therefore, if the magnitude of the coefficients |c,,|, as 
m —> œ, are bounded above by C/|m|*, with C a constant, then the series converges 
to a continuous function f and hence to a distribution. But distributions are always 
differentiable term by term an arbitrary number of times. Using (4.19) we therefore 
conclude that, if the magnitude of the coefficients of the series, as m — oo are bound 
by C|m|* for some number k > 0 and a constant C, then the series converges to a 
distribution. 

We derived the Fourier series starting from the Fourier transform and its property 
that converts convolution into a product. We therefore expect a similar property for 
the Fourier series. Consider the convolution of two distributions S° and T° with the 
same period 7 . The Fourier coefficients of the resulting series are 


Cn (S° * T°) = 2 (S° * T°, en smeet 


= = (S°(t) Q T° (à), emme) 
= = (s° (t), eo met (T°(2), en smo.n) 
= T Cm(S°) Cm(T°) ; Go 


Consequently the Fourier series of the convolution of S° and T° is 


CO 
Sox To =T 5 Cm (S°) Cm (T° ert (4.21) 


m=— 00 


and, indeed we see that the Fourier series representation of periodic distributions 
transforms convolutions into products. 


Example 4.10: Fourier series of ôy 


The mth Fourier coefficient of the Dirac comb ôy is 


1 1 

m(ôr) = = (d°, e Imect\ — — 

enl6r) = =| Jaz 
with we = 27/7. Hence, its Fourier series is 


[0,0] 


ôr = a Z emos . 


m=— 0 


If we now compute the convolution of 5; with another 7 -periodic distribution T 
with Fourier coefficients c,,(T), from (4.21) we see that, as expected, ôy act as a unit 


Cn (T x ôr) =T Cn (ôr) Cn(T) = Cn(T) . 
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A periodic distribution can be represented as a convolution product between the 


Dirac comb 67 and a distribution different from the one of (4.15). For example, with 
&r any unitary function we have 


T =T Ý ér(t—mT) 


= Do T@-mT)r(t-mT) 
=: $ S@-mT)=Sxôr (4.22) 


which defines a distribution S whose support is finite and larger than a single period 
of T. Using this representation we can express the Fourier coefficients and the Fourier 
transform of T in terms of the one of S as 


f iio s §(w) 8( ) (4.23) 
= We ‘eo = SS w w — MWe . 
T m=—0o 3 
Smo.) 
m(T) = : 4.24 
CalT) F (4.24) 


For this reason, if in some calculation we obtain the Fourier transform of a signal in 
this form, with Š the transform of a known non-periodic distribution, then we can 
immediately write T in terms of S as in (4.22). 

We close this section with a property that is the counterpart of (4.10) for the 
Fourier coefficients of a real periodic distribution 


Tm = Cm. (4.25) 


4.6 Extension to Several Variables 


The Fourier transform can be extended to functions of several variables by trans- 
forming each variable individually. That is, if f is an integrable function on R”, 
then we can apply the one-dimensional Fourier transform to each variable individ- 
ually, keeping the other ones constant. After performing this operation with respect 
to each variable in turns, we obtain the following expression which defines of the 
n-dimensional Fourier transform 
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f(@1,...,@n) = F {f} (@1,..-5 On) 


[0,6] CO 
= f cee | F(t oe TIC T OTE m dt... dtn. 
== — 


To shorten the notation we will write 
f@)=F{f}@)= / fee d'r (4.26) 
R” 


with t, œ € R” and 


n 
(w, T) = So Omtn : 


m=1 


The n-dimensional inverse Fourier transform can be derived with the same procedure, 
and we obtain the following definition 


1 
FHF) := aF [ f@edl@ da. (4.27) 


With these definitions it’s easy to see that our definition of Fourier transform for 
tempered distributions remains valid for n > 1 as well. All properties carry over in 
similar form. For example, looking back at the derivation of the symmetry relation 
given by (4.8), we see that in the n-dimensional case it becomes 


F [2o] = Ox)" T(—0). (4.28) 


The only difference from the one dimensional case is the fact that the factor of 27 
becomes (27r)”. This happens to all properties involving factors of 277. 

The most important convolution property (4.13) remains unchanged, as can easily 
be verified by inspecting the derivation for the one-dimensional case. 

Before proceeding, it’s convenient to extend the multi-index notation that up to 
now we only used in conjunction with the differential operator. Let a be an n-tuple in 
C” and k a multi-index that we allow to include negative numbers (k1, .. . , kn) € Z”. 
Then we can define 

k kı k, 


a* := aj ...a" (exponentiation) 


ka := (kiai, ..., kan) (direct product) 
yy fe = YD... YS fig.....k, (Summation) 
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with fg some function parameterized by the multi-index k and l, u lower resp. upper 
multi-indices. If in a summation we write integer numbers instead of / and u, we 
intend multi-indices equal to that number in every position. 

We can introduce a multi-index notation for the factorial as well. However, this 
only makes sense for tuples of natural numbers k € N” 


k! := Tan.. 
i=l 


Example 4.11: Fourier transform of ô 


In Example 3.1 we saw that the 6 distribution in D'(R”) is the tensor product of one 
dimensional 5’s. Hence, with t, A € IR” and the results of Example 4.5 the Fourier 
transform of the n-dimensional shifted ô becomes 


F (8(t — A) =e 
and it’s partial derivative 
F {D6(t)} = (Jæ) i=l,...,n. 


Using the multi-index notation, the higher order partial derivatives can be conve- 
niently expressed as 


F {D*5(t)} = Uo. 


Using the n-dimensional symmetry relation given by (4.28) we also immediately 
find 


F fe 07] — (27)” ô (w = wc) 
and 


F {(—yt)*} = 2r)" D‘5(@). 


As in the one-dimensional case, the other properties of the n-dimensional Fourier 
transform are immediate consequences of the convolution property and the convo- 
lution and transform of the ô distribution. 

A periodic function on R” is a function that is periodic in each independent variable 
individually, that is, such that there are positive numbers 7; fori = 1, ..., n, called 
the period of the ith independent variable, so that 
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FT ises ti Sis res Ta) =J Cis oss Tene n) 


This extension of the concept of a periodic function to higher dimensions permits 
us to widen the definition of periodic distributions (3.23) on test function of higher 
dimensions D(R”) in a straightforward way 


(T(t, ...5T +T5,---5 Tr), O (Threig Ti ++ Tr) 
= (T(T,...,Ti,---, Tr), P(T1, +5 Ti, +s Tr)) a 
From this follows without any difficulty an extension of the second Definition 3.5 as 


well. 
The n-dimensional Fourier series of a periodic distribution T € D’(R") is 


CO 
> c(T) e7 20T) 
k=—0o 
with k an n-dimensional multi-index, we the n-tuple (27/Tq,..., 27 /Tn) and c,(T) 
the Fourier coefficients 
d= — ere 
Fperra T, d 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, 
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate 
credit to the original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s Creative 
Commons license, unless indicated otherwise in a credit line to the material. If material is not 
included in the chapter’s Creative Commons license and your intended use is not permitted by 
statutory regulation or exceeds the permitted use, you will need to obtain permission directly from 
the copyright holder. 


Chapter 5 A) 
Laplace Transform of Distributions cicit; 


5.1 Definition 


The classic Laplace transform is closely related to the Fourier one and has similar 
properties. In a way it can be seen as a modification of the latter in such a way thatit can 
handle exponentially growing functions. To achieve this the transformable functions 
are required to be bounded on the left.! Concretely, the classic one-dimensional 
Laplace transform is defined on so-called original functions f : R —> C with the 
following properties: 


1. f)=0 for t<0O. 
2. There is areal number op such that f(t)e~°” is absolutely integrable over R. 


The greatest lower bound op satisfying the last property is called the abscissa of 
convergence of f. 

The Laplace transform of an original function f is a function of the complex 
variable s := o + Jæ (0, œw € R) defined by 


F(s) := L{f} (s) =f fOe™dt Ms} > or. (5.1) 
0 


We adopt the common convention of denoting the Laplace transform of a function 
f with the same letter, but capitalized. That is, for this example F = L {f}. 

We want to extend this definition to distributions with support contained in [0, oo), 
that is to right-sided distributions T € D’,. To this end note that the integrand in 
the above definition is the product of an original function f with an indefinitely 
differentiable function with unbounded support. If we multiply the latter by any 
indefinitely differentiable function y (t) with support bounded on the left and equal 
to 1 on a neighborhood of [0, oo), then we obtain the product of two functions 


' There are left-sided and two-sided Laplace transforms as well, but we are not going to discuss 
them. 
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with support bounded on the left without changing the value of the integral. Then, 
since f is an original function, for any 09 > of the product f(t)e7™™ is a (regular) 
tempered distribution and y (t)e~*‘e™’, for R{s} > oo, a test function of fast descent. 
We have thus obtained a way to define the Laplace transform for a restricted class of 
distributions. 


Definition 5.1 (Laplace transformable) A distribution T € D’, is said to be Laplace 
transformable if there exists a constant 09 € R such that 


T(the ™ 


is a distribution in S’(R). The greatest lower bound or is called the abscissa of 
convergence of T. 


Definition 5.2 (Laplace transform) The Laplace transform of a Laplace trans- 
formable distribution T is defined by 


L{T} = (Tne, pe S) for R{s} > 09 > or 


with y any indefinitely differentiable function with support bounded on the left and 
equal to 1 on a neighborhood of the support of T. It is commonly abbreviated by 


(T(t),e") for R{s} >or. 


The right-half plane R{s} > ør is called region of convergence (ROC). 


If T is Laplace transformable, its transform is a well-defined number for any value 
of the complex parameter s with R{s} > ør. In other words, it is a function of s. 
Since s only appears as a parameter of the test function of fast descent y (t)e7™ 670%", 
with the continuity and linearity of distributions it is easy to see that 


D;(T (t), e *) = (T(t), te“) = (-tT (t), e™*) f 


In addition, since e™* is an entire analytic function, in the right half-plane R{s} > or, 
£{T} is a holomorphic function. This is an important result as it allows to use many 
results from complex analysis. 

Higher order derivatives are obtained by iterating the above result so that we have 


D; L{T} (s) = £{(—*T} (8). (5.2) 


Note that the abscissa of convergence of the derivatives is the same as the one of T. 
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Example 5.1: Laplace Transform of ô 


The Laplace transform of ô, of ô(t — a) and of D*6 are 


—Ssa 


(8, e) = 1 
(i(t—a),e “) =e 
iD ie) = (8, (1) De) =o", 


In all cases the region of convergence is the entire complex plane C. 


Example 5.2: Laplace Transform of 1, (t) t* / k! e” 


Let a be a complex number. The Laplace transform of the regular distribution 
1.(t) e” is 


ca 1 
le“, er = f e Sat dt= 
0 S—a 
with abscissa of convergence Oexp(at) = 4. 
From this and (5.2), the Laplace transform of 
tt at 
14) i e 
is readily found to be 
tk (1% , 1 1 
14t) — e} = D‘ = i 
efron | kl s-a (s-a 


5.2 Properties 


The Laplace transform is a linear operation: given two Laplace transformable 
distribution S and T with abscissa of convergence oy resp. or, the transform of 
their weighted sum is 


L{qSteT)=cqL{Ss}+oaLl{T} for R{s} > max(os, or). 


Let a be a complex number. Then, from the definition, the Laplace transform of 
eT ( t) 


£ {eT} (8) = (TO), e) = (T (0), tt = LIT} (s +a) 


78 5 Laplace Transform of Distributions 


with region of convergence R{s} > or — {a}. 

We saw in Example 3.6 that the convolution of distributions in D’, is always 
well-defined. Therefore, the convolution of two Laplace transformable distributions 
S and T is well-defined. Then, for Ji{s} > oo = max (os, or), the transform of their 
convolution product is by definition 


L{S « T}(s) 
— (S * T)e y (the 8-0") 
= (S(t) 8 T (AJETA, yt + A)Je TETA; f 


By noting that, over a neighborhood of the support of S& T, y(t+A)=1= 
y(t)y (A) we can proceed further and obtain 


L£{S* T}(s) 

= (SeT Q TAT, y (te 8-0 y (Ae 7E) 
= (SeT, (TET, yAe E) y (e6) 
= (SeT, y (e7 ET T (e7, y (A)e 767A) 
= L {S} (S)L{T} ($) 


which is well-defined since, in the specified ROC, the Laplace transforms of S and 
T are holomorphic functions. We thus see that, as the Fourier transform, the Laplace 
transform converts convolutions into products 


L{S*x TJ =L{S}L{T} for R{s} > max(os, or). (5.3) 


A key advantage over the Fourier transform is that here the multiplication is between 
functions that are holomorphic in the specified open right-half plane. 

In a similar way as we did for the Fourier transform, we can use this property 
to derive several additional properties in a straightforward way. Specifically, using 
the properties of the convolution product and the Laplace transform of the Dirac 
6 and related distributions (Example 5.1), we immediately obtain the properties in 
Table 5.1 that we have not yet discussed. 


Example 5.3: Convolution of exp’s 


Let k and / be natural numbers and a a complex constant. We want to calculate the 
following convolution product 


t r 
407 e” * 07 e” x 


From the convolution property of the Laplace transform and the results of Exam- 
ple 5.2 the transform of the above convolution product is 
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Table 5.1 Properties of the Laplace transformation 


T L{T} ROC 
Xn Gn Tin Yin Um LAT Ya R{s} > sup,, (Om) 
S«T LIS L{T} Ris} > max(os, or) 
T(t —T) es" L{T} Ris} > or 
eT T(s +a) KR{s} > or — Nfa} 
D‘T sk LIT} Ris} > or 
(COT DEL{T} Rs} > or 

1 1 1 


(s— a) (s-a) — (s —a)EH ` 


With it, we find the desired result as 


k+l 
1 —— e”, 
+(t) +d! e 


5.3 Inverse Laplace Transform 


The Laplace transform isn’t only similar to the Fourier one, it can also be formally 
related to it. Consider first an original function f. By writing its Laplace transform 
as 


f f@e“e!™ dt 
0 


we see that, for every value of o > øp, it can be interpreted as the Fourier transform 
of the function f(t)e~’. 

This relation between the two transforms can be extended to distributions. 
Consider a Laplace transformable distribution T. We have established that, for 
N{s} > or, its Laplace transform is a holomorphic function of s. In addition, by 
definition, for every value oo > or, T(t) e~™ is a distribution of slow growth and 
for o > 00, y(the ©” e- J” is a test function of fast descent for every value of 
w. We conclude that the Laplace transform of T considered as a function of w for 
fixed o must be a regular distribution of slow growth. Hence, the following integral 
is well-defined 


I (T(t), y (e7 e)p (w) dw 
R 


for any @(w) € S. This integral can be recognized as the tensor product 1(w) @ 
T(t)e~™ and, using Fubini’s theorem, it can be rearranged to become 
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(THe ™ yOe em [ero dw) 
= (THe, OO) = (FAT) e}, @@)) . 
We thus obtain the claimed relation between Laplace and Fourier transforms 
LIT} =F{e T} for o >or (5.4) 


which gives us a formal way to invert the Laplace transform. 

A first consequence of this relation is that, given the Laplace transform of a 
distribution T with abscissa of convergence or < 0, we can immediately find its 
Fourier transform by setting s = jw 


T(o) = LIT} Uo). 


Another important consequence of (5.4) is that, if £ {T} = 0 on a vertical line 
with R{s} > or, then T = 0. In fact, with the above result we have that 


0 = (L{T},¢) = (Fle-"'T}, p) = (e~'T, ĝ) 


from which we conclude that e~°‘T and hence T must vanish. In addition, with 
T = S—U this implies that, if £{S} = £{U} on a vertical line of the region of 
convergence, then S = U. In other words, if a function, holomorphic in an open 
right-half plane, is the Laplace transform of a distribution in D’,, then it is the 
transform of a unique distribution. 

The next logical question to ask is: which holomorphic functions are transforms 
of a distribution? To answer this question, consider first an holomorphic function F 
bounded by 


C 
|F(s)| < — for R{s} > o > 0 


Isl? 
with C a constant. Then 
[e6] oo C 
/ |F(oo + jo)e!"| dw < J -3z dw < œ 
68 -œ 06 + @ 
and the inverse Fourier integral 
[e6] 


— F (o0 + Joe” dw 
20 J- 


exists and defines a continuous function that we may write as e7% f (t). The thus 
defined function f is therefore 
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Fig. 5.1 Integration path for i 
t<0 w 
R 
¥ atl 
00 o 
A 
Tı l 
oot poo 
fA = I F (o + jaye!” dw 
2m J_oo 
1 [e] 
= =l F(oo + Jwe CH dw 
20 J- 
1 Oo+JOo 
= — F(s)e" ds (5.5) 
20 J Joo—joo 


and corresponds to the integral of an holomorphic function along the vertical line 
defined by R{s} = oo. If we write the variable s in its polar representation s = Re/? 
it’s easy to verify that in the right-half plane and for t < 0 


C 
|F(s)e"| < RI 
Therefore, if we close the integration path of the above integral by first making the 
line finite, then closing it along the half-circle shown in Fig. 5.1 and then taking the 
limit R — œ, the value of the integral remains unchanged. In fact 


1 
| F(s)e" ds 
27 J T2 


Having closed the integration path we can now use Cauchy’s theorem and conclude 
that for t < 0, f(t) = 0. 

Cauchy’s theorem can also be used to show that the value of the integral is the 
same along any vertical line with R{s} > 0. To show this, we integrate along two 


Rn =0. 


< lim 7 
R>o 27 R 
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vertical segments and close the path with horizontal ones. Since the contribution of 
the horizontal paths vanishes as we extend the length of the vertical ones toward 
infinity, we conclude that the value of the integral along the two vertical lines must 
be the same. 

We have thus established that the function f doesn’t depend on the value of 
do, is continuous and vanishes for t < 0. These characteristics make f an original 
function and hence a Laplace transformable distribution in D’,. Furthermore, from 
the definition of f and (5.4) we see that F is its Laplace transform. 

Now consider the more general function G(s) = s* F(s —og) withog € R, k € 
N and F as before. From the properties of the Laplace transform we know that it 
is the transform of the distribution g = D* (e°! f) which is also clearly in D'_. We 
therefore conclude that every function G that, for some og € R, is holomorphic in 
the open right-half plane t{s} > og and is bounded above by a polynomial P 


IG(s)| < PAs) Ms} > oc (5.6) 


is the Laplace transform of a distribution in D‘,. 

Without going into the details we also mention that the converse is also true. That 
is, every Laplace transformable distribution T € D’, is a derivative of some regular 
distribution associated with a continuous original function [16]. 

With the transforms of Examples 5.1 and 5.2 we can find the inverse Laplace 
transform of any rational function of s by partial fraction expansion. Note also that 
(5.5) corresponds to the classic inverse Laplace transform for functions. 


Example 5.4: Laplace versus Fourier Transform of 1, 


In this example we calculate the Laplace transform of the Heaviside step function 
1,. While it’s easy to obtain it directly from the definition, we calculate it from 
our previous results D1, = 6 (Example 2.8) and £ {Dô} = s (Example 5.1). This 
is to compare it with the methods used in Examples 4.8 and 4.9 to obtain its Fourier 
transform. 
By using 
D1, = Dô x 14 


and the convolution property of the Laplace transform, we obtain the following 
equation for the Laplace transform of 1+ 


sL{14}=1. 


Then, since all Laplace transforms are holomorphic functions in an open right-half 
plane and only the zero distribution has zero as its transform, we can conclude that 


1 
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As an extra step we want to obtain the Fourier transform of 14 from its Laplace 
transform. The abscissa of convergence of £ {14} is not smaller than zero. Therefore, 
we can’t obtain it by simply setting s = jw. However, in cases like this, where the 
abscissa of convergence is zero, given the continuity of distributions, it’s still possible 
to obtain the Fourier transform as a limit, so that 


I $BÌis}) ds 
r S J 


(F {14}, ¢) = lim 


Rs} 10 


with I’ a vertical line in the ROC@. The limit is only problematic around the origin, 
where we can integrate along a small half-circle of radius € 


1 43 
lim =f ASEN jy 
RENO J Jr 


S 
1 n/2 i 
=p | | tO dot f EEA seer av 
e40 J ljal>e J® =n /2 ee/? 
m/2 
=tim | f O aw} + (0) do 
€{0 loj>e JO —1/2 


1 
= (pv — + mô, $). 
Jo 


Example 5.5: Exploiting Continuity 


To simplify the notation we assume that all appearing functions (more correctly, 
regular distributions) disappear for t < 0. For example, we write t% for 14 (t) t*. 

From the results of Example 5.2, by setting a = 0, we can note a dualism between 
positive and negative powers 


tt 1 
Lizi = za Mb) > 0. 


If we sum the first N powers of t we obtain 


Using the continuity of distributions we can let N tend to infinity. The original 
distribution converges to the exponential function, while the transform becomes a 
geometric series (plus a factor) 
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i< vt 
t — — m 
= ie sg D sk 


that converges for |s| > 1. This means that there is a right-half plane R{s} > 1 where 
the series converges and can be summed to obtain the expected result 


= s1l-=l/s s—1 


Lfe} R{s}>1. 


Note that the last expression can also be expressed as a geometric series of positive 
powers of s 


However, this series only converges if |s| < 1. Consequently, there is no right-half 
plane where the series is holomorphic and hence it isn’t the Laplace transform of a 
distribution. 


5.4 Extension to Several Variables 


The Laplace transform can be extended to functions of several variables in the same 
way as we did for the Fourier transform. Let t be an n-tuple in R”. A multi-variable 
original function f : R” — C is a function that is an original function with respect 
to each variable independently, that is 


1. f(t) = Oif any t < 0,i =1,...,2. 
2. There is a oo € R” such that f(t)e~“” is absolutely integrable over R” . 


The classic Laplace transform is then defined as 
LIA) =f fear (6.7) 
R4 


with s € C” and R} the n-dimensional Cartesian product of the half-line [0, oo). 

With this definition we see that the definition of the Laplace transform for distribu- 
tions extends to higher dimensions essentially without modification. By interpreting 
k as a multi-index and all variables as n-dimensional ones, all properties of Table 5.1 
remain valid. The only exception is the classic inverse Laplace transform integral 
(5.5). As we have seen, this integral is based on the inverse Fourier transform and 
the factor 27r J has therefore to be replaced by (27 J)”. 
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Chapter 6 A) 
Summable Distributions E 


In this chapter we study a class of distributions, summable distributions, that can 
be extended on smooth bounded functions. These distributions have properties that 
are well suited to describe some classes of systems. However, the material is more 
technical than the rest of the book and can be skipped without loss of continuity. 


6.1 Definition and Canonical Extension 


One can define summable distributions in several equivalent ways [16]. The following 
one is the most suitable for our purposes. 


Definition 6.1 (Summable distributions) A summable distribution T is a distribution 
that can be represented as a finite sum of derivatives (in the sense of distribution) of 


functions fy € L! 
T = 5 D* fi 


|k|<m 


with k an n-tuple in N” and m € N. 


We denote the vector space of summable distributions by D7... 

An important property of summable distributions is the fact that they can be 
extended to continuous linear functionals on 8, the set of indefinitely differentiable 
functions that, together with all their derivatives, are bounded 


B:= {p €C™ D*@ is bounded, k € N"}. 


As usual, to talk about continuity, we need to define a convergence criterion 
(topology). 
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Definition 6.2 (Convergence in B) A sequence of functions (nj) in B converges 
to zero if the sequence as well as all sequences of the derivatives (D*n j) converge 
uniformly to zero as j tends to infinity. That is, given the norms 


Pan) = >. sup |D*n(z)], 


n 
|k|<m teR 


the sequence (7 ;) converges to zero if, as j tends to infinity, the sequence of numbers 
(Pm(n;)) converges to zero for all m € N. 


The application of a summable distribution T to a function n € $ is well defined 
since 


|(D* fi. n)| < sup |D*n(z)| a lfe(t) | d"t < œ. 


teER" 


This shows that it’s possible to extend a summable distribution to 8. However, the 
extension in general is not unique. The reason being that the set of test functions 
D is not dense in 8. That is to say that it’s not possible to approximate to arbitrary 
accuracy any function in 8 with functions from D. 


Example 6.1: Constant Function 


Consider the constant function 1: t + 1, the function a € D defined by (2.11), and 
functions a; € D defined by a;(t) = a(t/j), j € N. The product æ;1 is clearly a 
test function satisfying @ ;(2j) = 0 for all values of j. From this we see that 


po(aj1 = 1) = sup jO -1O = 1 


no matter how large j is. 


Example 6.2: Average Functional 

Consider the functions 1, œ; from Example 6.1 and the following functional L on 8 
1 re? 

Ln) i= Jim f ndr. 


C/2 


It is easily seen that L is linear and continuous. Its value on the constant function 1 
is 1 while its value on the test functions a ;1 is zero for all values of j 


ar i Aj 
|L(@j)| < dim = S laj(e)lde < dim = =0. 
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The functional L is therefore a valid extension to $ of the zero distribution as is the 
zero functional on $. 


We can define a unique, canonical extension of a summable distribution T by 
requiring an additional condition on the extension [19]. A suitable condition can be 
obtained from the properties of Lebesgue integrals. Consider again the test functions 
a; € D from Example 6.1 and a function 7 € 8. If we apply T to a;n we obtain 


(T, ajn) = > 1p" [ S(t) D* (æ; (t)n(T)) d"t 


|k|<m 


= Yel ( fi(t) D'n(t) d"t + 
ItlsJ 


|k|<m 


fil) Dijona) a") . 


Itl>J 


The integral of an L! function can be approximated up to an arbitrary € > 0 by an 
integral over a suitably chosen compact subset K of R”. Therefore we can find a 
large enough N such that for j > N 


(T,ajn)=e+ >> (-1)" f(t) D' y(t) d'r. 


\k|<m ItI<j 


Thus, in the limit as j tends to infinity we obtain a well-defined continuous linear 
functional on 8. 

The important observation from this derivation is the fact that, to find an extension 
to 8 of a summable distribution, it is not necessary to require uniform convergence 
on the whole of R”. An extension can be obtained by requiring uniform convergence 
on every compact subset K C R”. More precisely, by requiring the convergence cri- 
terion that we defined for the space &. From this observation we define the following 


property. 


Definition 6.3 (Bounded convergence property) A continuous linear functional on 
B has the bounded convergence property if, given any sequence (nj) of functions 
nj € B with pm(nj) < œœ for all m, and converging to zero in the space E as j — 00, 
then 

(T,;) > 0, lmao. 


The sequence (a@;7) does converge to 7 in &. Hence, by continuity, there is a 
unique extension of T to 8 with the bounded convergence property 


lim (T, an) = (T, n) . (6.1) 


jroo 


In particular this shows that this extension does not depend on the particular repre- 
sentation of T in terms of derivatives of integrable functions. 
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The converse is also true. The restriction to D of any continuous linear functional 
on 8 with the bounded convergence property defines a unique summable distribution. 
Thus, there is a one to one correspondence between summable distributions and 
continuous linear functionals on 8 with the bounded convergence property. 


Definition 6.4 (Canonical extension) The canonical extension to 8 of a summable 
distribution is the unique extension to a continuous linear functional on 8 with the 
bounded convergence property. 


In the following, whenever we use the extension of a summable distribution it 
will always be assumed to be the canonical one. 

While our previous definition of differentiation carries over to summable distribu- 
tions without problems, this is not the case for multiplication. In general the product 
of a bounded function 7 € 8 with an unbounded one y € Gis not bounded and there- 
fore not in 8. Differently from this, the product of two bounded functions n, ¢ € B 
is always in 8. Therefore, for summable distributions T € D, multiplication has 
to be restricted to functions in B 


(nT, 6) = (T, nf) . 


6.2 Convolution of Summable Distributions 


In Sect. 3.2 we defined the convolution product between two distributions S, T € D’ 
by 
(S* T, 6) = (S(t) ® TA), b(t +A)) 


and saw that in general, if the support of both S and T is unbounded, it may not exist. 
In this section we show that if S and T are summable, then their convolution product 
is well-defined despite the fact that their support is unbounded. 

Consider the application of a summable distribution T to a function tT œ> 7n(A + 
t) € B with à a parameter. Following the same arguments as in Sect. 3.1, given 
the linearity and continuity of T, we deduce that it is a continuous and indefinitely 
differentiable function ¢ belonging to 8 


GA) = (T(t), nA + T)) . 


For this reason the convolution of two summable distributions S and T is always 
well-defined 
(ST, n) = (SA), (T(t), nA + T))) = (S, ¢) 


and commutative. 
Next we investigate the convolution of a summable distribution T with a function 
in B. Consider the application of T to the function t +> (A — t) € B parameterised 
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by à. As we just saw, it is a function that we call again ¢ and that is clearly locally 
integrable. Hence, it defines a distribution in D’ and with @ € D we can write 


(TCE), nA = T), oa) = (p) = (6,5) = $), (TE), nA — T))) 


p(A) ® T(t), NA — T)) 


( 
= 
i f. bama- 1)a") 
= 


T(t), f. 6G + one) a") 
T(t) @n(), OE +2) = (T #7, 0) . 


or 
(T(t), nA — t)) = (T * mA) (6.2) 


This shows that a summable distribution can be regularised by a function in 8 and 
that the resulting regularised is also a function in $. 


6.3 Fourier Transform of Summable Distributions 


The functions t + e-/@-”) with œ € R” belong to 8. For this reason the Fourier 
transform of a summable distribution T can be expressed in a simple way. Let ọ € D, 
then 


(F{T}, $) = (T(t), F{O}(@)) = (re, Í b(oe d'o) 
= (T(t), ($ (œ), e 7) = (T(t) 8 6@), e0) 
= (6(@), (T(r), e 7) = (T(z), e0”), p) 
or 


F{T}(@) = (T(r), e 7%) . (6.3) 


F {T} is thus a continuous function. Moreover it has at most polynomial growth, for, 
by representing T as a sum of integrable functions and the properties of the Fourier 
transform, for some m € N we have 


IF {T}()| = |S) Vo) F{ fk) 


|k|<m 


A 


< Ý lof [ IAEI a't < C0 + lool)” 


|k|<m 
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with C a constant. Thus, the Fourier transformed of a summable distribution is a 
function of slow growth. 

The converse is not in general true, but we can find a class of functions for which 
it is. This is the set Oy, the set of functions of slow growth that are indefinitely 
differentiable. 

To see that this is the case, consider the Fourier transformed T of some tempered 
distribution T and assume that 7 € Oy. If p € OCS then its Fourier transformed 
ĝ as well as oT are in S. Therefore we see that 


xT =F 'OTI}ESCL! 
is a summable distribution and we can apply it to a function 7 € S to obtain 


($ * T, n) = (PA) @ T(t), NA+ T)) = (T(t), (PA), nA + T))) « 


Since @ € D and ņ € Bare arbitrary and (@(A), n(A + T)) € B we deduce that T is 
a summable distribution. 

We conclude this section by showing that the property of the Fourier transform 
of transforming convolution products into ordinary products is valid for arbitrary 
summable distributions. Let S, T be summable distributions, then using (6.3) and 
the property of the exponential function e~/+*- = e-/@%eJA-) one readily 
obtain that 


F{S xT} =F {S}F{T}. (6.4) 


The product is well defined as F {S} and F {T } are both functions. 
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Part II 
Systems 


Chapter 7 A) 
Convolution Equations get 


The objective of this chapter is to show that the solution of ordinary differential equa- 
tions, if based on distributions as opposed to functions, can be obtained by (mostly) 
algebraic methods. These methods are rigorous forms of the so-called Heaviside’s 
operational or symbolic calculus. The close relationship to the integral transforms 
that convert convolution into the ordinary multiplication is also shown. 


! Notation 


With this chapter we stop using uppercase letters such as T to denote distributions. 
Instead, we start using lowercase letters such as the ones typically used to denote 
functions, for example f. We also adopt the convention of denoting the Laplace 
transform of a distribution, say f, with the same letter, but changed to uppercase, e.g. , 
F = L{ f}. When we need to distinguish between the ordinary and the distributional 
differential operator, we will in general denote the former by i and continue to 
denote the latter by D. 


7.1 Convolution Algebra 


An algebra A is a vector space together with an associative product © such that 
multiplication of any two vectors produces another vector in A and such that for any 
constants a, b and any vectors f, g, h € A the following distributivity laws are valid 


(af +bg)Oh=a(f Oh)+ d(g Oh) (7.1) 
f © (ag + bh) = a(f Og)+b(f Oh). (7.2) 
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The convolution product seems like an adequate product to make an algebra out 
of distributions. Unfortunately, as we saw, the convolution product is not defined for 
arbitrary distributions. The solution is to restrict the set of distributions to a vector 
subspace of D’ on which the convolution is well-defined. 


Definition 7.1 (Convolution algebra) A convolution algebra A’ is a vector subspace 
of D’ with the following properties: 


e The convolution product is associative. 
e A’ with the convolution product forms an algebra. 
e ôisin H. 
A convolution algebra is thus an algebra with a unit and for which the product is 
always commutative. We also note that the triple (A’, +, x) forms a commutative 
ring. 

We have already met three examples of convolution algebras: (i) the set of right- 
sided distributions D’,, (ii) the set of periodic distributions and (iii) the set of distri- 
butions with compact support &. 


7.2 Convolution Equations 


In this section we study convolution equations. We will see that they provide a 
framework for studying a broad class of systems that is the time-domain counterpart 
of one based on the Laplace transform. 

A convolution equation is an equation of the form 


gxy=x (7.3) 


with g and x given distributions and y a distribution to be determined. In this section 
we assume g, x and y to be elements of a convolution algebra A’. Suppose that g 
has an inverse in A’, that is, there is an element denoted by ge! € A’ such that 


Then g*~! » x is a solution of the equation for any x, since 


y=g agy =g len. 


Note that if there is an inverse g*~! then it must be unique, since if ie is another 
inverse we have 


*—1 


gx(g* | — gh!) = (gxg*!)—(g «gi ') =0 


Conversely, suppose that (7.3) has a solution for any right-hand x. Then it has a 
solution for x = 6 and the solution is by definition the inverse of g. Consequently, 
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we can say that, if g has an inverse in A’, then the equation has a unique solution for 
any right-hand side x and the solution is 


y=gr tex. (7.4) 


Therefore, knowledge of g*~! permits to find the solution of (7.3) for any right-hand 
side x. For this reason g*~! is called the elementary or fundamental solution of the 
convolution equation. 

Note that if g has an inverse g*~', but it’s not an element of the convolution 
algebra A’, then the expression g*~! x x may not exist and g*~! x g * y may not be 
associative (see Example 3.5). Hence, (7.4) can not be proved to be equivalent to 
(7.3). 

Suppose that g; and g, are two elements of the convolution algebra A’ having 
inverses g! and ae respectively. Then their convolution product gı * go has an 
inverse as well and it is given by 


(g1* g2)* |= gf | * 95! (7.5) 
for 
(g1 * g2)* | * (g1 * g2) =ô 
*x—1 


= g1 kg] l kg * g) 
= (gp | * 83") * (81 * 22). 


From this we see that, if in (7.3) g can be represented as the convolution product 
of m invertible elements g;, i = 1, ...,m, then the solution of the equation can be 
expressed as the convolution product of their inverses 

*—1 


y=g eee kX. (7.6) 


In every algebra with a unit, one can perform a partial fraction expansion and every 
convolution algebra has a unit by definition. Therefore, every convolution product 
of inverses can be represented as a sum of inverses. 


Example 7.1: Partial Fraction Expansion 


Consider the following convolution product 
(D8 + a5)*~! x (D8 — b8)" ! 
with a and b different constants. Its partial fraction expansion has the form 


Ca( D8 + aô)! + cp(D8 — bd)*"! 
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with c, and c, constants to be determined. If we take the convolution of both expres- 
sions with 
(D6 + ad) x (Dô — bô) 


we obtain the following equation 
ô = Ca( D — bd) + cp(Dê + aô) . 


Equating the coefficients of 6 and Dô we obtain two equations for cg and cp whose 


solution is i 


a+b“ 


Cb = —Ca = 


A (convolution) algebra is said to be free from zero divisors if 
8ı*g2=0 
implies that either gı = 0 or go = 0. In this case the algebra is called an integral 


domain and a convolution equation with common factors on both sides of the equation 
can be simplified. For example, assuming f 4 0, the equation 


fxgexy=fx*x 


can be simplified to 
gey=x. 


In fact, the original equation can be written as 
f*(gey—x)=0 
and since f is different from zero, we can deduce the simplified form. 


We will see that the convolution algebra of right-sided distributions D’, is an 
integral domain. The algebra of periodic distributions D’(T) is not. 


7.3 Initial Value Problems 


In this section we want to apply the results of the previous section to study initial 
value problems. In particular let L denote the linear differential operator with constant 
coefficients of order m 


L = D” a (DO +---+aD +a 
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where for convenience we have set am = 1. We are interested in the solution of the 
differential equation 
Lyt) = x(t) (7.7) 


for t > O with initial conditions 
(DYO) =y k =0,...,m—1 (7.8) 


x and y functions and differentiation intended in the usual sense of differentiation of 
functions. 

As a first step in translating this problem into the language of distributions, we 
note that the convolution algebra D'_ is well suited for the study of initial value 
problems. Every element of the algebra can be thought of as being in a zero state for 
t < 0 and representing some excitation or state evolution for t > 0. The functions 
x and y can be associated with distributions of D'_ by extending them to negative 
values of t where we assign them the value of zero. To make this explicit it’s usual 
to show them multiplied by the unit step 14. 

The second step is to perform differentiation in the sense of distributions. With 
the results of Example 2.9, for the first derivative of 1, y we have 


DAs) yO) = 14()Dy(t) + yoô 
and similarly, for the higher order derivatives 


D'ALAYE) = 14D? y(t) + yoD + yd 


DALOY = 14D" y@) + yD" "5 +--+ ym—-18. 


Note that in all these expressions the first term on the right-hand side is the conven- 
tional derivative of the function y (multiplied by 1). Putting these results in the 
differential equation we obtain an equivalent equation for the distribution 14y 


m-1 
L(1}y) = 14Ly+ 5 op D48 
k=0 


m-—l1 


= 1x + a o,D‘5 
k=0 


with 


Ok = Aik Yo + A24kY1 F +t Vm—k-1 
m—1—k 


= > ainsi, k=0,...,m-1 (7.9) 
i=0 
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and am = 1. 

The last step required to translate the initial value problem into a convolution 
equation is to use the fact that the kth order derivative of a distribution can be 
expressed as the convolution product with D*é so that 


The initial value problem defined by (7.7) and (7.8) is therefore equivalent to the 
following convolution equation of distributions 


m—1 
Lô» 14y =14x+ ) o,D*s. (7.10) 
k=0 


With the results of the previous section, if the distribution Lô has an inverse in D’, 
(the elementary solution of the equation), the solution of the equation for arbitrary 
right-hand side 1x and initial conditions is given by 


m—1 


14y = (L8)*"! #14.x+ )° oD" [8]. (7.11) 
k=0 


It’s worth highlighting two important points. The first one is the fact that the 
differential equation (7.7) is not a full description of the problem. To fully specify 
the problem it has to be accompanied by the initial conditions expressed by Eq. (7.8). 
Differently from this, the convolution Eq. (7.10) is a full description of the problem. 

The second point that we want to highlight is the fact that (7.11) is a global 
solution of the problem, that is, the solution is specified for all times. Differently 
from this, the classical solution of the original initial value problem is a function 
only valid for t > 0. 

Next we show that the inverse (L5)*~! exists. To this end note that if we insert it 
in (7.10) and set x = 0 as well as og = 1 and og = 0, k = 1,...,m — 1 we obtain 
the equation defining the inverse 


Lé x (L&T! =ô. 
The inverse of Lô is thus the distribution 1e with e the function which is the solution 
of the homogeneous equation 
Le(t) =0 


with initial conditions 


D”'e(t)=1 and D‘e(t) =0, k=0,...,m—2. 
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Example 7.2: Fundamental Solution 
Consider the differential operator 

L=D+a. 
The solution of the homogeneous differential equation Le(t) = O with initial condi- 
tion e(0) = 1 is 

elt) =e. 
The inverse of Lô in the convolution algebra D’, is therefore 

(L8)! = (D8 +.a6)* !=1,(De. 


This is easily verified by inserting it into the convolution equation for the operator L 


(D8 +. aô) x 11 He" = DA Me™) + ale 
= —a1 (He +6+al.(ne" = 8. 


Lô * (L6)*~! 


In a similar way we find 


pl 


(Dô + a)” = LO Gat 


—at 
with m a positive natural number. 

Let’s focus for a moment on the distribution Lô and observe that it looks like a 
polynomial P with D6 playing the role of the independent variable 

Lô = D5 + am-1 D”7'8 + +++ + a, D8 + a8. 
Any polynomial can be represented as a product of factors 
P(z) = (z — z1) — 22) ++ — Zm) 
with z; the zeros that may or may not be distinct. From this and remembering that 
D‘8 x D'S = D**'6 

we deduce that the distribution Lô can be factored in a similar way. If we denote 
by f* the convolution product of k > 0 distributions equal to f with f*° = ô and 
group common factors, then Lô can be represented as 


LS = (D8 — z18)*"' x (D8 — 298)? * +++ & (D8 — 2,8)" 
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with /; the multiplicity of the jth factor. The inverse (L5)*~! can then also be factored 
(L8)*~! = (D8 — z,6)*" x (D8 — z28)*T x «++ & (DS — zô)" 


f**k denoting the inverse of f**. With this factorization the elementary solution can 
either be directly expressed as a convolution product 


hl phl t 
TOT a were 


or, by first performing a partial fraction expansion, can be expressed as a sum of 
convolution-free known distributions. 

To show the relation to the Laplace method, we Laplace transform Eq. (7.10). The 
Laplace transform of the distribution Lô becomes a true polynomial in the variable 
s and the convolution product becomes the conventional multiplication so that the 
convolution equation becomes an algebraic equation 


m-1 
P(s) ¥(s) = X(s) + J` os* 
k=0 
P(s) = (8 + Gms"! +--+ ais + ao) 
= paj e = 22)? +102 (8 a 


The Laplace transformed of the inverse (L5)*~! is the reciprocal of P (s) and corre- 
sponds to the Laplace transform of the elementary solution e 


1 


With it the solution of the convolution equation can be written as 


m-—1 


Y(s) = E(s)X(s) + E(s) 5 ons* . 


k=0 


The solution y of the original equation is then found by inverse Laplace transforming 
Y. In most cases this is most conveniently accomplished by partial fraction expansion. 

This shows the parallelism between convolution equations in D'_ on one side and 
the Laplace transform method on the other one. In particular the distribution D6 is the 
time-domain counterpart of the variable s, the convolution product the counterpart 
of the ordinary multiplication and ô the one of the multiplicative unit element 1. 
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Example 7.3 


Consider the differential equation 
[D? + (a — b)D — ab] y(t) = x(t) 


with initial conditions (Dy)(0) = y(0) = 0 and assume that a and b are different 
constants. The corresponding convolution equation 


(D6 + aô) * (Dô — bô) * y = x 
has as elementary solution the convolution product 
e = (D8 + a8)! * (D8 — b8)*! 


with partial fraction expansion (see Example 7.1) 


e= [- (D8 + a8)! + (Dd — b8)*"']. 


a+b 


The inverse elements appearing in e were calculated in Example 7.2. Using those 
results we can express the elementary solution of the equation as 


1 
e(t) = zip [-1,.(1) ew + 14.4) e”] n 


If we Laplace transform the equation, the procedure is completely parallel. The 
Laplace transformed of the elementary solution is 


Oe a a 
a aab Sta s—b 


and by inversion we obtain the same distribution e. 


We have seen that the initial value problem described by (7.7) and (7.8) can 
equivalently be described by the convolution (7.10). While the differential equation 
only has a meaning if x is a continuous function with isolated jump discontinuities, 
the convolution equation remain well-defined if 1x is replaced by any distribution 
in D’,. In particular, we can consider more general convolution equations of the form 


m—1 
Lô» y= Nô xx +Y o%D*5 
k=0 
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with 
N= b, D” + b,- D"! ++ bo ; 


x any distribution in D’, and where it’s understood that the solution y must belong to 
the convolution algebra of distributions in D’, . As before, the solution of the equation 
is found by convolving with the convolutional inverse of Lê 


m—1 
y = (L8)*"! x NS * x + 5 or ( LET! x DES. 
k=0 


We want to establish if it’s possible to replace the second summand on the right-hand 
side, representing the initial conditions, by a suitably selected input signal composed 
by a weighted sum of a Dirac pulse and it’s derivatives, such that, in the complement 
of t = 0, the solution y remains unchanged. To this end its convenient to consider 
the Laplace transformed of y 


Z m-l k 
ro = fixo + H n 


with Z = L{N ô} a polynomial of degree n and the other symbols having the same 
meaning as before. The Laplace transform of the sought for input signal is a poly- 
nomial 

X (s) = xs? + + x0 


and it must be selected in such a way as to satisfy the equality 


ZE) Eo as* 
Po O = PO + W(s) 


with W(s) another polynomial. This polynomial corresponds also to a weighted sum 
of Dirac pulses and its derivatives, and hence only changes y at t = 0, which we 
allow. 

The conditions for the existence of such an input signal X (s) can be determined 
with the help of the division theorem of polynomials. It states that, given polynomials 
Q(s) and P(s) Æ 0, there are unique polynomials R(s) and W (s) satisfying 


Q(s) = P(s)W(s) + R(s) 


with the degree of R(s) being lower than the one of P(s) [21]. From this theorem 
we deduce that, provided Z(s) and P(s) are relatively prime, that is, assuming that 
they have no common factors, we can select X (s) so that the rest of the division of 
Z(s)X(s) by P(s) corresponds to ons o,s*. To achieve this we need m degrees 
of freedom, one for each ox. In other words, the input polynomial X (s) must have 
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degree m — 1. Then we can choose the coefficients of X (s) in such a way as to obtain 
the desired values for the rest of the division. 
If Z(s) and P(s) have acommon factor K (s) then 


Z(s)X (s) . K (s)Z'(s)X (s) z K(s) / R(s) 
PO KP) =F (Fewo) 
KORG) 
= “Po + W(s) 


and we see that the rest of the division K (s)R(s) has a constrained form that can’t 
be made to match eae o,s* for arbitrary ozs. 

We have therefore established that, in a convolution equation derived from an 
initial value problem, the terms representing the initial conditions can be replaced by 
a distribution x composed by a weighted sum of a Dirac pulse and its derivatives if 
and only if Z(s) and P(s) have no common factors. If we perform this substitution, 
in the complement of t = 0, the solution of the equation y remains unchanged. 


Example 7.4: Replacing Initial Conditions 


Consider the initial value problem 


(D? +aıD +a) y = (bı D + bo) x 
(Dy)\0) =y, yO)=yo. 


The corresponding convolution equation is 
(D8? + a, Dé + aod) * y = (bı DS + bo) * x + yoDS + (aryo + y1)6. 


Our objective is to replace the initial conditions by an input signal composed by a 
Dirac pulse and its derivatives so that in the complement of t = 0 the solution y 
of the convolution equation with this input signal is identical to the solution of the 
equation with initial conditions and no input signal. 
Expressed in the Laplace domain the problem is thus to find the coefficients of 
the polynomial 
X(s) = xs + x9 


such that 
Z(s)X (s) = R(s) 


P(s) — P(s) 


+W(s) 
with 


Z(s) = bis + bo, P (s) = s$?” + as +a, R(s) = yos + a1 yo + yı 


106 7 Convolution Equations 


and W(s) an arbitrary polynomial of degree lower than 2. By performing the poly- 
nomial division of the left-hand side of the equation we obtain 


S(—ay b,x; + box, + bixo) — agbix1 + boxo 


+ b,x. 
s2? + ais + ao 1e 


Thus W (s) = b,x, and, by comparing coefficients of this expression with the right- 
hand side of the equation, the coefficients of X (s) are found to be 


_ (abi — bo) yi + [(a? — ao)bi — aibolyo 
aob? _ aybob; + b? 
biyı + (aıbı — bo) yo 
X1 = 7 7 
agb; —aybob, + bo 


Xo 


This solution is well-defined except when the denominator, which is the same for 
both x; and xo, becomes zero. This happens when 


aob b 
a = a 
bo by 


In this case the polynomial Z(s) becomes a factor of P(s) 


aobı 


24 (2P , Po) | bolic 32) 
S Sei =s a = AY —S ae 
bo bi 0 1 0 bı bo 


in accordance with our general treatment of the problem. 


Before concluding this section we show the important fact mentioned before that 
the convolution algebra D', has no zero divisors. To see this, consider a test function 
@ that is real-valued and positive everywhere on its support, for example 6, from 
Example 2.1. We call such a function a positive test function. In Chap. 3 we saw 
that every distribution can be represented as the limit of a sequence of indefinitely 
differentiable functions. Let (gm) and (ym) be such sequences converging to g and y 
respectively and, for simplicity, assume that all functions are real-valued. Then, for 
every m there exists an open interval U contained in the support of gm where, for 
every positive test function ¢ with support in U, its value always has the same sign, 
for example positive 


(8m, ¢) >0. 


We can make a similar construction for y; as well. In addition, we can introduce a 
parameter à such that 
à > (Yi), í +A)) 
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is a positive (or negative) test functions of A with support in U. Then, assuming again 
a positive sign, 


(8m * Yis Ø) = (8m(A), (Yi), O(T + A))) 
must be positive for every m and i and, with the continuity of distributions and 


convolution, so must be the limit. Consequently, if g x y vanish for every test function 
then either g or y must be the zero distribution. 


7.4 Integro-Differential Equations 


Some initial value problems are naturally formulated as ordinary integro-differential 
equations 


D” y(t) + Am—1D™y(t) +.---+a,Dy(t) + aoy(t) 
t t Tn-1 
taf ydunt tan | f Y(Tn) dt +++ dT 
0 0 0 
= x(t) 
with initial conditions 
(D* y)(0) = yk k=0,...,m—1. (7.12) 


We still need initial conditions, but this time only m of them as the remaining infor- 
mation is included in the integrals. 

These problems can be converted into convolution equations in the convolution 
algebra D’, in a similar way as we discussed before. The new terms are the ones that 
are expressed as integrals and these can be written as convolution products 


t 
[ vevan 
0 
l 7 Sit 
T -f (Ty) dT,- dT) 
0 0 


The corresponding convolution equation is therefore 


14.(t) * 14. y@) 


I7O «LOO. 


(D"5 + dm—1D"'5 + +++ +a, D8 + aod 
+aaly +H ant) * 14. OVO 


m-1 


= 1,()x() +Y oD‘ 


k=0 


108 7 Convolution Equations 


with oz, k = 0,...,m — 1 as defined in (7.9). 

As we have seen, the convolution algebra D’, is an integral domain. For this 
reason we can multiply both sides of the equation with a non-zero distribution without 
changing the result. If we choose D”6 as the distribution and make use of the fact 
that 1, (t) is the inverse of Dé 

Déb*14=6 


the equation becomes 


(p™*"65 + amı D™ "3 ees -a_n6) * 14.(t)y(t) 
m—1 

= D" x 14 x(t) + > o, D‘*"§ , 
k=0 


This is the type of convolution equation that we discussed in Sect. 7.3 and is solved 
by the same method. The solution of integro-differential equations thus requires no 
new technique. 

The procedure of transforming the convolution equation that we just discussed 
is similar to the standard procedure used to convert an integro-differential equation 
into a differential equation by differentiating the equation. The key difference is that, 
while the former handles initial conditions automatically, the latter method requires 
extraction of additional conditions from the original equation. 


7.5 Periodic Solutions 


One is often interested in periodic solutions of differential equations. These solutions 
are most conveniently found with the help of the convolution algebra of periodic 
distributions. 

Consider again the convolution equation obtained from the differential operator 
L of Sect.7.3 where now the unit element of the algebra is the Dirac comb ôy 


Lép * y =X. 


In Sect. 4.5 we established two important properties of the Fourier series: 


1. The first one being that the Fourier coefficients of the convolution product of two 
Fourier series is equal to the product of the coefficients of the individual series 
times the period (Eq. (4.20)). 

2. The second one being the fact that differentiation corresponds to multiplication 
of the kth Fourier coefficient by the factor Jkwe with we = 27 /T . For this reason 
the kth Fourier coefficient of the distribution L467 is proportional to a polynomial 
P evaluated at kw, 


7.5 Periodic Solutions 109 


1 


celLôr) = [Gko)" + ama (kwe)! +--+ a (koe) + ao] = 


1 
= P(jko.)—. 
Uko) 


By representing both x and y by their respective Fourier series and using these two 
properties, we can transform the above convolution equation into algebraic equations 
for the Fourier coefficients. Let’s denote by cg the kth Fourier coefficient of x and 
by dx the one of y. Then the equation becomes 


(ska. — z1)" (gk@e — z2)? -+> (gk@e — Zn)” dg = ck 


where, as before, we have expressed the polynomial P by its zero factors. To solve 
the equation we have to distinguish three cases: 


1. If one or more zeros z; of the polynomial equals jkw, for some integer k and the 
coefficient cz of x is different from zero, then the equation has no solution. 

2. If one or more zeros z; of the polynomial equals jk, for some integer k and 
the coefficient cg is zero, then the equation has an infinity of solutions. In fact 
in this case d can be any number. Note also that if z; is equal to jk@, then the 
convolution product 

Lôr * elet =0 


vanishes, which means that the convolution algebra of periodic distributions has 
zero divisors. 

3. If no zero z; equals ykw, for any value of k then the equation has the unique 
solution given by the Fourier series with coefficients 


— P(Jko) ` 


dk 
Example 7.5: Cont. of Example 7.3 


We look for a periodic solution of the convolution equation of Example 7.3 
(Dôr + ad) * (Dôr — bêr) * y= x 


assuming that the real part of a and b are both positive. In particular, we are interested 
in the elementary solution e of the equation. By setting x = ôy and expanding it by 
its Fourier series we obtain the following equation for the kth Fourier coefficient of e 


1 1 
TT (gka. +a)(Jkos— b) ` 


Ck 
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By performing a partial fraction expansion and with the help of (4.24), we recognize 
them as the coefficients of the Fourier series of the distribution 


e(t) = g(t) * ôr 


with 


rar: [ee +1, e"] . 


In fact the Fourier transform of g is 


z 1 —1 1 
g= a+b t] 
_ 1 
~ (Jæ +a)(Jæ — b) ` 


Note that g is a distribution of slow growth. The elementary solution of the equation 
in the algebra of periodic distributions is therefore the sum of periodically shifted 
tempered solutions of the differential equation. 

Suppose now that we are interested in the solution for x(t) = Ae/®’. The only 
Fourier coefficient of x different from zero is cı = A. The Fourier coefficients of y 
are then also all zero except for 


dı = Tc e = A &(@) è 
In this case the solution y of the equation is therefore 


y(t) = A Bla) e. 


7.6 General Convolution Equations 


7.6.1 General Solutions 


In this section we consider generic convolution equations of the form 
§*¥y=xX 


with g, y and x generic distributions in D’. Here the situation is different from 
when working in a convolution algebra. First the convolution between g and y may 
not exit. To guarantee its existence g must have compact support. This includes 
many important cases, for example, all linear differential operators with constant 
coefficients. 
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Second, g may not have an inverse. For example, if g € D we have seen in Sect. 3.2 
that g x y € & and so can’t equal 6 for any y € D’. If it does, then the equation has 
an elementary solution, but it only serves to find solutions for x having compact 
support, otherwise the last convolution in 

— ot! — «xl 
Y =g *8*Yy=g *X 
may not make sense. 

Further, the homogeneous equation 


gxy=0 


may have solutions different from y = 0. For this reason there may be an infinity 
of elementary solution, two of them differing by a solution of the homogeneous 
equation. 

Despite these facts, general convolution equations have many practical applica- 
tions. 


Example 7.6: Electrostatics 


Let p denote the electric charge density and u the electrostatic potential, both func- 
tions of the position in space. In empty space the two quantities are related by 
Poisson’s equation 
_ p(x) 

€0 


Au(x) = 


with A the Laplace operator, x € R? the vector specifying position and €o the per- 
mittivity of free space. This equation can be written as a convolution equation 


Ad xu = ap ; 
€0 


One can show that the inverse of Aô is 


1 
An |x| ` 


If the charge density p is distributed over a finite region Q C R? then the generated 
potential is 


1 


The homogeneous equation has solutions different from the trivial one: the so- 
called harmonic functions. 
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7.6.2 Tempered Solutions 


If x is tempered and one is interested in tempered solutions of the equation then the 
convolution equation has a sense not only for g of compact support, but for the larger 
class of distributions of rapid descent O% [16]. This case is particularly important 
because one can then use the Fourier transform which may make it easier to find a 
solution. 

In the following we briefly consider the one dimensional case where g is a linear 
differential operator with constant coefficients L and the convolution equation has 
the form 

Lox y=x. 


In this case there always is at least an elementary solution. If we Fourier transform 
both sides of the equation we find the equivalent equation 


P$=î 


with P a polynomial (and thus in Ôm). 

If P has no zeros, then the only solution of the homogeneous equation is the 
trivial one and the inverse of P is a function of slow growth 1/P € Oy. The only 
elementary solution of the equation is therefore the summable distribution 


e= Fo) 
me 


If P has a zero at œw, then the homogeneous equation has nontrivial solutions. In 
particular, we saw Sect. 2.5.1 that if the multiplicity of the zero is k then the sums 


k-1 


2 Cm D"5(@ — Wp) 


m=0 


with cm constants, are all solutions of the Fourier transformed homogeneous equation 
P $ = 0. The solutions of the original homogeneous equation are found by inverse 
Fourier transformation to be 


» Cm i" elr! 
TT 


The equation has therefore an infinity of elementary solutions. In addition, since 
1/P ¢ Oy, the solutions are not summable distributions. 

Note that the equation may have non-tempered solutions that are not captured by 
Fourier transform techniques. 
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Example 7.7: Cont. of Example 7.3 


We look for a tempered solution of the convolution equation of Example 7.3 
(D6 + ad) * (Dô — bd) x y = x 


assuming that the real part of a and b are both positive. A tempered elementary 
solution is easily found by solving the Fourier transformed the equation 


1 1 
~ P(@)  (ja+a)(jo—b)’ 


and determining its inverse 


e(t) = atb ERO, ew + 14(-1t) e] à 


Note that despite the similarity between ê(w) and E (s) of Example 7.3 the tempered 
elementary solution is different from the solution found in the convolution algebra 
D. 

Since P (w) has no zeros, e is the only elementary tempered solution of the equa- 
tion. Other solutions obtained by adding any linear combination of the solutions of 
the homogeneous equation (e~” and e”’) growth exponentially as t tends either to 
oo or to —oo and are therefore not tempered distributions. 
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One often has to solve a set of n simultaneous equations in n unknown distributions 
Yl» ---> Yn 

Sir * yi + 812 * y2 +++ + Bin * Yn = X1 

821 * yi + 822 * Y2 +++ + 82n * Yn = X2 


8n1 * Y1 + 8n2 * Y2 + -° + San * Yn = Xn 


with g jm coefficients distributions, x1, . . . , x, right-hand side distributions and where 
all distributions belong to a distribution algebra A’. This system of equations can 
conveniently be written in matrix form 


GxY=X (7.13) 


with G the n x n matrix with elements gj,, and Y, X vector valued distributions 
(column matrices) with elements y; and x; respectively. The space of vector valued 
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distributions is denoted by D’(R”, C”) and application of a test function @ € D(R”) 
to a vector X is defined as the application of ¢ to each component individually 


(xı ’ Q) 
(X, ġ) := : 
(Xn, $) 


The determinant of the matrix G is defined as usual, with the convolution product 
replacing the standard product. It is a convolution belonging to the convolution 
algebra A’. For example, the determinant of a 2 x 2 matrix G is 


det k | = 811 * 22 — 821 * 812- 
§21 822 


Suppose that the matrix G has an inverse G*~! 
G x G77! = ôI 


where 6/ is the identity matrix with the unit of A’ on the diagonal and 0 everywhere 
else. If we compute the determinant of both sides of this equation we obtain 


det(G x G*7!) = det(G) x det(G*~!) = det(61) = 6 


from which we deduce that, if the matrix G has an inverse then det(G) has an inverse 
in A’. Conversely, if det(G) has an inverse, then we can compute the inverse of G 
by 

G*"! = det(G)*! + G7 


with G the matrix of cofactors and G” its transpose. 
We conclude that (7.13) has a solution for arbitrary right-hand side X if and only 
if det(G) has an inverse in A’. The solution is given by 


Y=G* !«X. (7.14) 


One shows in a similar way as for a single equation (see Sect.7.2) that G*~! and 
hence the solution of the equation is unique. 
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Chapter 8 A) 
Linear Time Invariant Systems geai 


We assume the reader to have familiarity with linear time-invariant (LTI) systems. 
In this chapter we merely summarise the main results of this theory. We are going to 
call the quantities that are considered the input, the output and some characterization 
of the system signals. This should evoke a meaningful interpretation in most of the 
systems that we are going to discuss. Mathematically they are distributions. 


8.1 Basic Definitions 


The meaning of time-invariant is very intuitive: suppose that we apply the input 
signal x(t) to a system represented by an operator H and observe the signal 


y) =HAlx(t)] 


as its output (Fig. 8.1). The system is said to be time-invariant if by applying the 
delayed input signal x(t — tT) we observe the same output signal as before, except 
for a delay in time by an amount 7, that is, if 


y(t — tT) = H[x(t —T)]. (8.1) 


The concept of linearity is subtler. A defining property of a linear system is the 
validity of the superposition principle: if yı (t) is the response of the system to the 
input xı (t) and y2(t) the one to x2(t), then the response to a linear combination of 
these inputs is 


yt) = Hleix E) + c2x2(1)] = a Hix ()] + Hix] 
= ciyi(t) + c2y2(t) (8.2) 


with cı and cz constants. However, if we limit the definition of a linear system to this 
property, then we admit pathological systems as the following one. 
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x Pr h m y 


Fig. 8.1 Representation of a single-input single-output LTI system H 


Example 8.1: A Discontinuous System [22] 


Consider a system accepting as input a piece-wise continuous function with at most 
a finite number of isolated jump discontinuities. The system response consists in the 
sum of the input signal jumps from —oo to the present time t. 

The system satisfies (8.2). However, the behaviour is rather peculiar. If we apply, 
say, a rectangular input then the output is also rectangular. But, if we approximate 
to any degree of accuracy the rectangular input with a continuous function, then the 
output is always zero. 


To exclude systems with such a bizarre behavior, we require linear systems to be 
continuous: if as m € N tends to oo the sequence of input signals Xm (t) converges 
(in the sense of distributions) to the signal x(t), then the system response ym(t) 
corresponding to input Xm (t) converges to the response y(t) corresponding to x(t). 

Suppose that we apply an impulse ô(t) to the input of the system H and observe 
the signal A(t) at its output. Then, by linearity, if we apply a finite number of pulses 
the output must be 


n n 


AL) aj S(t — t] = Yo aj h(t rj) = ht) Y aj S(t — 1). 
j=l 


j=l j=l 


In Sect. 3.3 we saw that every distribution can be represented as the limit of a finite 
series of Dirac impulses. From this and the linearity of convolution (Eq. (3.19)) we 
obtain that, in the limit as n tends to infinity, if the input converges to the signal x(t) 
the output of the system converges to 


y(t) = h(t) * x(t). 


We therefore define 


Definition 8.1 (LTI System) A single-input, single-output (SISO), linear time- 
invariant (LTI) system is a system that, when driven by an input signal x(t) produces 
the output 


y(t) = h(t) * x(t) (8.3) 
with h(t) the impulse response of the system. 


A system is called real if, when driven by a real distribution, its response is a real 
distribution. In other words, if its impulse response is a real distribution. 
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While we have been talking about signals depending on time, we can abstract 
from that and talk about signals depending on a generic n dimensional independent 
variable à € R”. In this case, instead of time-invariance, it makes more sense to adapt 
(8.1) to 

ya-—t) =H[x—7)] 


and talk about translation invariance. A single-input single-output, linear translation- 
invariant system is then still described by a convolution product similar to (8.3) where 
however the independent variable t is replaced by the abstract n dimensional variable 
à. We are going to call a system of this type an LTI system as well. 


8.2 Causality 


Assume for simplicity that h and x are integrable functions of time. The response 
of a system characterized by h when driven by the input x can then be written in 
integral form 


y(t) = f h(t)x(t—t)dt= f h(t —t)x(t)dt. 


Suppose now that the input vanishes for t < 0. Then from 


(oe) 


y(t) = [rc —t)x(t)dt 


0 


we see that in general the system may produce a nonzero response y(t) for t < 0, 
that is, before the input signal x(t) has been applied. 

If a system is causal, that is, if its output at time tọ can only depend on values of 
the input signal at times £ < fg, then its impulse response h(t) must vanish for t < 0. 
In other words h must be a right-sided distribution in D’, . 

Note that in our interpretation of signals as being functions of time, non-causal 
systems are not physically implementable and appear to be meaning-less. However, 
non-causal systems are sometimes useful in theoretical studies. In addition, in many 
situations the theory of LTI systems can be applied to systems where the quantities 
of interest (the input and output) are not functions of time (see Example 7.6). 
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8.3 Stability 


An important aspect of a system is its stability. Let x(t) be a bounded function, that 
is, satisfying 
IIX lloo = sup|x(t)| < 00. 
teR 


The response of a system characterized by the impulse response A(t) to such an input 
signal is 
y(t) = h(t) * x(t). 


The output y(t) is well-defined if 
(hx x,@) < œ 
for every test function @ € D and for every sequence (m) converging to zero in D 


lim (h * x, Gm) =0. 


m—> Ooo 


In this case we say that the system is bounded-input bounded-output (BIBO) stable. 
For a system to be BIBO stable 


(h(t) * x(t), @(@)) = wo. f x(t)b(t + t) dt) 


must have a meaning. Observe that the inner integral is an indefinitely differentiable 
bounded function. For the convolution to have a meaning the impulse response of 
the system must therefore be extensible to a continuous linear form on 8. As we saw 
in Sect. 6.1 this is only the case if h is a summable distribution. Thus, for a system 
to be BIBO stable, its impulse response must be a summable distribution. 

We mention without going into details that the definition of a BIBO stable system 
can be extended to input signals that are so-called bounded distributions and usually 
denoted by 8’ or Di... [16]. 

The series connection, or cascade of two stable systems results in a stable system. 
This is so because the convolution of summable distributions is always well-defined 
and is itself a summable distribution. In addition, for linear systems the order of the 
connection is irrelevant as, if h 4 and hg are the impulse responses of the two systems 


haxhgp=hgpxha. 
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8.4 Transfer Function 


8.4.1 Stable Systems 


If a system is stable then its impulse response h can be Fourier transformed and the 
transformed Å is a continuous function of slow growth called the frequency response 
of the system. If the input signal x is also a summable distribution then it can also be 
Fourier transformed and the Fourier transform of the output signal can be represented 
by the product . 

$o) = hw) w). (8.4) 


If the input signal x is J -periodic, then the system can be analysed in the convo- 
lution algebra of periodic distributions. To do so the impulse response h is converted 
in a periodic distribution by convolving it with the unit of the convolution algebra of 
periodic distributions 67 


Provided that hz is well-defined, which for stable systems is always the case, then 
the output of the system can be represented by 


y=hr «x. 


Note that while the convolution used to define hz is the convolution in D' (R), the 
latter is the convolution in D'(T). As discussed in Sect. 7.5, the equation is most 
conveniently solved with the help of the Fourier series. If we denote by cm (y), Cm (Av) 
and Cm (x) the mth Fourier coefficient of y, hy and x respectively, then the equation 
is solved if 


Cm(y) =T em (Ae em (x) 
for every m € Z. From (4.24) we know that 


(moe) 
T 


cm(hr) = 


with we = 27/7. Therefore, by knowing the Fourier transform of the impulse 
response we can immediately obtain the Fourier coefficients of the output signal 
by 7 

Cm(¥) = h(Mmoc)cm (x) . (8.5) 


In particular, if the input is the complex tone e/“‘, the output is also a complex tone 
at the exact same frequency 
y(t) = hlode. 
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If the input of the system is the sum of two (or more) periodic signals x4 and 
xg with incommensurate frequencies w, and wz, that is, if the ratio of the two 
frequencies w4 /@g is an irrational number, then the input signal is not periodic, but 
almost periodic. Due to the linearity and continuity of the system, the response can 
still be calculated by the above technique for each input separately and the result 
combined 


o0 


y(t) = > h(ma@a)Cm (xaje "e + í (mog)cm(xg)e f 


m=— 00 


8.4.2 Causal Systems 


If the system is causal, that is, if its impulse response A is a distribution in D’, , and 
one is interested in the system response for right-sided input signals x € D’, , then the 
system response y can be calculated in the convolution algebra D'_. In particular, if 
h and x are Laplace transformable then the Laplace transformed of the output signal 
can be calculated by 

Y(s) = H(s)X (s). (8.6) 


The Laplace transformed H (s) of the impulse response h is called the system transfer 
function. 
If the system is BIBO stable, then the ROC of H (s) includes the imaginary axis 
s = jæ. In this case the Fourier transformed of h is immediately obtained from the 
transfer function by . 
h(@) = H(Jøæ). (8.7) 


Note that if the system is not BIBO stable then this relation is not valid even if the 
Fourier transform of h does exits. See Example 5.4 for a simple example where the 
system corresponds to an ideal integrator. 

In the following we are going to denote distributions belonging to D', N Di, by 
Diy 


8.5 Rational Transfer Functions 


Consider a causal system described by a rational transfer function 


N(s) _ bns” F bys"! eG! bo 
O P(S) O s” + amas”! $+ + ao 
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Given the Laplace transform X (s) of the input signal x, the Laplace transformed of 
the output is 
N(s) 


P(s) 


Y(s) = X(s). 


If we multiply both sides of this equation by P(s) we obtain 
P(s)¥(s) = N(s)X(s) 
and by inverse Laplace transforming the equation we obtain the convolution equation 


(D"8 +a 1D" +--+ aod) * y 
= (bm D” 8 + by D”718 +--+ + boô) * x. 


With the results of Sect. 7.3 we see that this equation corresponds to the initial value 
problem described by the linear differential equation with constant coefficients 


Ly(t) = xXa(t) 
with 


L = D” + am-1 D”! +- +a, 
Xa(t) = (b, D” FT bn- D"! ee bo)x(t) 


and zero initial conditions 
(D‘y)(0) =0, k=0,---,m—1. 


For this reason y(t) = h(t) * x(t) is called the zero state response of the system. 

It is obvious that the procedure can be reversed. We have therefore established a 
one-to-one correspondence between systems described by a rational transfer function 
and systems described by a linear differential equation with constant coefficients and 
zero initial conditions. 

If the transfer function H of the system is minimal, that is, if its numerator and 
its denominator are relatively prime polynomials, then, in the complement of t = 0, 
it is possible to recreate the same output that would be produced by solving the 
corresponding initial value problem with non-zero initial conditions. This is achieved 
by driving the system with an input signal consisting of a weighted sum of a Dirac 
pulse and its derivatives 


X = Xm_1D" 18 +--+ + x08 
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and by suitably selecting the weighting coefficients xo, ...,X;,—1; as described in 
Sect. 7.3 (see Example 7.4). Such a system is said to have order m and to be observ- 
able and controllable (see Sect. 8.6). 

If H(s) is a proper rational transfer function, that is if m < n, then it can be 
expanded into a sum of partial fractions of the form 


C ik; 
(s — p“? 


with p; the jth zero of P(s), Lj its multiplicity and c;,, constants. From Example 7.2 
and the properties of the Laplace transform we therefore see that the impulse response 
h is the sum of products of polynomials and exponential functions. In particular, we 
see that the system is stable if the real part of the poles of H(s) are negative 


R{pj} <0. 


If n is not smaller than m then H (s) can be decomposed into the sum of a poly- 
nomial and a proper rational function. The impulse response / is then the sum of the 
above polynomial-exponential functions and a weighted sum of Dirac impulses and 
its derivatives. 


8.6 System State 


In this section we review the concept of the state of a system. To this end consider 
the initial value problem described by the system of n differential equations 


d 
—u = Åu +x, u(0) = ug € C” 
dt 


with A € C”*” ann x n matrix and u and x n dimensional vectors of complex valued 
functions of time. As before we can translate this initial value problem in the language 
of distributions by replacing the (conventional) derivative with the distributional one 
and work in the convolution algebra of right sided distributions 


Du = Au + uô +x. 


If we rearrange the equation and convolve each term with 71 we obtain the equiv- 
alent equation 
(I8 — A14) xu = 114 * (upd + x). (8.8) 


This form shows that the equation can be solved by left convolving both sides of 
the equation with the inverse of (78 — A14). Observing the analogy with the geo- 
metric series, provided it converges, the latter can be represented by the following 
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series, where the standard product of the geometric series has been replaced by the 
convolution product 


(16 — A14)! = 15 + A14 + (A154)? +- 


The iterated convolutions are easily evaluated 


t”! 
A1 *n Angee = An 
os? + (n— 1)! 
and using the identity 


1% = 19 a 8 hee DOS Ve DS 


we obtain 


œ n—1 t" 


CO 
= n t n 
(I8 — A1,)* '=18+) A"- — =) A"— 1, x Dô. 
a (n )! £ n! 


The last series can be expressed with the help of the exponential matrix defined by 


[00] 
At n t” 
= A” — 8.9 
e 2 F (8.9) 
which converges for every value of t 
(16 — A1,)*"! = 14e“ x Dô. (8.10) 


Having established the convergence of the series, using the linearity and continuity 
of convolution one readily sees that indeed it defines the desired inverse 


(16 — A14) * [I8 + A1, + (A414)? + ---] = 18. 
The solution of the equation is therefore given by 
u = 1,04! x I (D8 x 14) * (upd + x) = 1h“ ug + 146^ ex. (8.11) 


The exponential matrix has several useful properties that are immediately verified 
using its defining series 
eAteAt — Ate) AO 


(e4)! = e74 De“ = Ae“! = e“ A . 
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Note however that in general 
efe? £ e^t8, 


This is only valid if A and B commute, that is AB = BA. 
Consider now the state space representation of a SISO LTI system 


Du = Au + uô + Bx, Aec" Fec™ (8.12) 
y = Cu + Dx cect! Dect (8.13) 


where now x represents the input signal of the system and y its output. The vector u 
is called the state of the system and (8.11) shows that it’s value uo at a given point in 
time fo is the minimum amount of information required that together with the input 
signal at times t > t, allows determining the system behaviour at all future times 
t > tọ. In other words, the system state uo at time tọ summarises the effect on the 
system of all past values of the input signal and of previous states. 


8.6.1 Controllability 


It’s interesting to ask if it’s possible to design the input signal in such a way that the 
system can be set in an arbitrary state uo in finite time. That is, can we design the 
input signal such that for t > f the state vector equals u(t) = e4’ ug? 

The problem is most easily analysed using impulsive inputs, starting from the 
zero state. From the above results we know that the system state dependence on the 
input signal x is given by 

u = 1e B xx. 


Suppose that for an n dimensional system we use an input signal consisting of a 
weighted sum of a Dirac impulse and its derivatives up to order n — 1 


x = xo +x DS +--+ xn 1 D”!8. 
Since the system is linear, we can analyse the contribution of each term individually 
11e“ B x x95 = 14e“ Bxo 
1e“ B xxi Dô = D(1,e" Bx,) = 1,e“ ABxı + 6Bx, 
14e“ B x x,;D" 15 = D! (14.e“ Bxn—1) = 146e% A" | Bxn1 + 
The terms replaced by dots on the last line are constituted by a weighted sum of a 


Dirac impulse and its derivative which are zero for t > 0. Putting all terms together 
we obtain for t > 0 
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X0 


xX] 
1e“ xx = 10" [BAB ... A"'B]. 


Xn-1 


From this we conclude that we can use a suitably designed input signal x to mimic 
the effect of an arbitrary initial state uo if and only if the matrix 


C := [B AB... A"-'B] (8.14) 


is invertible, in which case the weighting factors are 


Xn-1 


The matrix C is called controllability matrix. 

While the state of a system plays an important theoretical and conceptual role, in 
practice, when dealing with controllable systems we can always start from the zero 
state and drive the system in any desirable state. Things are completely different for 
non-controllable systems. As discussed in Sect. 8.6.3, these are systems possessing 
sub-systems that are not influenced by the input signal. In those systems the initial 
state may play an important role. 


8.6.2 Observability 


Another interesting question is whether it’s possible to reconstruct the initial state of 
a system at time tọ from the observation of its output at times t > tọ assuming that 
A, B,C, D and the input signal x are known. From linearity and knowledge of the 
input signal we can assume x to be zero. (Alternatively we could compute the part of 
the output signal due to the input signal—the zero state response of the system—and 
subtract it from the observed output.) The question is then if we can calculate uo 
from the observation of 
y = Ct ye“'ug. 


Suppose that the system is n dimensional. Then if we compute the first n — 1 
derivatives of the output signal we obtain 
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Dy = C1,e^ Auo + Cduo 


D"y = C1,e4 A"! 9 LERIN 


where in the last equation we have represented by dots a weighted sum of a Dirac 
pulse and its derivatives as before. Thus, the observation of the output signal and of 
its first n — 1 derivatives at times t > 0 allows setting up the following system of 
equations 


y(t) C 
Dy(t) CA 
lim N = A “Ug. 
t>0+ : : 
D"-'y(t) CA"! 


This system of equations can only be solved for uo if the matrix 


CA 
Ô := : (8.15) 


CA"! 


is not singular. The matrix O is called the observability matrix. 


8.6.3 Jordan Normal form 


The simplest way to understand the structure of a system that is either not controllable, 
or not observable is by considering the system in Jordan normal form. 
Consider a system in the state space representation 


Du = Au + Bx, AeC’™ Bec™! 
y=Cu CECE, 


In linear algebra is shown that, by choosing a suitable basis, every linear operator 
can be represented by a matrix of the following block form, called the Jordan normal 
form 


Ji 0 


8.6 System State 129 


-0—00 HO 


uo 


y 
x — = (bi) a7 =| 1.(f) "s =( cı =)> y 


(2) O 140 


Fig. 8.2 Jordan normal form representation of a system 


u2 


with 


the elementary Jordan matrix. The diagonal elements of J; correspond all to the ith 
eigenvalue 4; of A. If n; correspond to the algebraic multiplicity of eigenvector A; 
and v; to its geometric multiplicity, then there are v; Jordan blocks J; corresponding 
to eigenvalue À;. Thus, the total number of Jordan blocks corresponds to the number 
of independent eigenvectors of A. The Jordan normal form of a linear operator is 
unique up to permutations of the blocks. 

A matrix for which the geometric multiplicity equals the algebraic multiplicity 
for each eigenvalue is called semi simple. In this case each block J; isa 1 x 1 matrix 
and the Jordan normal form reduces to diagonal form. 

A system in Jordan normal form can be interpreted as the parallel connection of 
independent sub-systems, each represented by a Jordan block J;. Figure 8.2 shows 
the block diagram for a system with a simple eigenvalue Ao and a double eivenvalue 
A; with v; = 1. From the figure it’s easy to see that if bọ = O then the state variable 
ug can’t be excited by the input signal x. The same is true for u2 if b2 = 0. In either 
case the system is not controllable. One can check that these are the two conditions 
under which the determinant of the matrix C vanishes. 

In a similar way the figure shows that if co = O there is no path from uo to the 
output of the system and for cı = 0 there is no path from u. These are the two cases 
under which the system is not observable and correspond to the two conditions under 
which the determinant of the matrix O vanishes. 
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Fig. 8.3 a Not controllable a b 
system. b Not observable hp ia hp 
system Qo» y x 
x —S ha La ha |> y 


From these considerations we conclude that a non-observable system includes a 
sub-system whose output does not reach the global system output as schematically 
depicted in Fig. 8.3b. A non-controllable system includes a sub-system that is not 
reached by the input signal as schematically depicted in Fig. 8.3a. 


Example 8.2: Jordan Block 


Consider the system described by the following state-space representation 


Du = Au + Bx 
y = Cu 


with 


_ |@sap 1l _ [bo _ 
a=] 0 sal al i; C= [cc]. 


We want to compute an explicit expression for the exponential matrix e’4 allowing 
us to compute the response of the system to an arbitrary input signal x. 


The matrix 
wzxap l 
A= 
| 0 a 


is an elementary Jordan matrix and can’t be transformed in a diagonal matrix by a 
similarity transformation. In fact, as can be seen from the characteristic polynomial 


det(A — AI) = (sap — A)’, 


the matrix has a single eigenvalue A = w3gg with an algebraic multiplicity of 2 and 
the eigenspace belonging to this eigenvalue has dimension 1 


(4-owel)v= [5 5]>=0 => v=al4). acC. 


The matrix A can however be written as the sum of a diagonal matrix Ag and a 
particularly simple matrix A, 


= __ | ©3dB 0 01 
A=Aetac=|% onal + [00]: 
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Observe that the matrices Ay and A, do commute. For this reason we can use the 
following property of the exponential matrix 


el(AatAc) — ptAagtac | 


tAg 


Since Aq is diagonal, the first exponential matrix e’”“ is easily calculated to be 


ef Ad — et] 
A 


The second exponential matrix e'“¢ is easily calculated from the series defining the 
exponential matrix by noting that the square of the matrix A, vanishes 


efe = 7 +1A,. 


Putting these results together we obtain 


10 Ot lt 
tA __ ,@zapt — p%®3dBİ 
ese (oiloa) =e" Loa]: 


The above method can be used to calculate the exponential of any elementary 
Jordan matrix with the only modification that for an n x n matrix A it is the nth 
power of the matrix A, that vanishes. 


In the following we are always going to assume that the systems under consider- 
ation are controllable and observable. 
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Chapter 9 A) 
Weakly Nonlinear Time Invariant get 
Systems 


9.1 Introduction 


As outlined in Chap. 1, the behavior of nonlinear systems is substantially richer than 
the one of linear systems. To deal with them there is a set of techniques, each one best 
suited to analyse particular aspects or particular classes of nonlinear systems. We 
target systems that are stable about an equilibrium point and that depend continuously 
on the input signal. 

Before analysing in more details this class of systems, we give a short overview, 
mostly by way of examples, of systems described by nonlinear ordinary differential 
equations of the form 


Dy = f(t,y), f:I1xX—> R” (9.1a) 
with J C R, X c R” and initial conditions 
yO) =yeX. (9.1b) 


We limit ourselves to the aspects that are helpful in better framing the concept of 
weakly nonlinear systems. 

A first important difference compared to systems described by linear differential 
equations with constant coefficients is the fact that a solution may not exist for all 
t > 0 or may not be unique. 


Example 9.1: IVP with many Solutions 


Consider the following initial value problem (IVP) 


Dy=yVlyl y0) = y. 
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If yo > O then the equation can be solved by the method of separation of the variables, 
and we obtain the unique solution 


y(t) = EEN t>0. 


If yo = 0 then y(t) = 0 is a solution. However, it is not the only one. For any 
constant c > 0 the function 


yet) = 140-0) 


A t=, t>0 


is also a solution as one easily verifies by inserting it in the equation. 


For yo < 0 we can again use the method of the separation of the variables to find 
the solution i 
yO = -7V l- 9". 


However, due to the fact that at y = O the function 1/,/|y| is not continuous (not 


even defined) this solution is only valid as long as y(t) < 0. When y(t) reaches zero 
the equation can again be satisfied by multiple solutions 


—42VJTyol — D? t € [0, 2/170) 


Ye(t) = 40 t € [2,/lyol, ©) 
i-o)? t € [c, 00). 


Therefore, for some initial conditions the equation has uncountably many solutions 
(Fig. 9.1). 


Fig. 9.1 Two solutions of 
the initial value problem of 
Example 9.1 with yo = —0.5 
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From the above example we see that continuity of f is not enough to guarantee 
the existence of a unique solution of the initial value problem (9.1a). To guarantee 
uniqueness of a solution the function f(t, y) must be more regular with respect to y. 

Let Z C Rand X C R”. A function f € C(I x X, R”) is called locally Lipschitz 
continuous in x if every point (fo, yo) € Z x X has a neighborhood U x V such that, 
for some constant M > 0 


IFE y) -= fE Mly- xl, teu, x,yeV. 


If the function f(t, y) in (9.1b) is continuous in ¢ and locally Lipschitz continuous 
in y then Picard-Lindeléf’s theorem guarantees the existence and uniqueness of the 
solution of the initial value problem (9. 1a) [23]. 

If the function f doesn’t depend explicitly on time, then the system is time invari- 
ant and the system equation becomes 


Dy =f), f:X>R", XCR”. (9.2) 


A solution of the equation for which Dy = 0 is called an equilibrium point of the 
system. When one investigates the stability of an equilibrium point y, one can always 
assume it to be at the origin. In fact, by the change of variable u = y — y, one can 
always transform the system differential equation in one whose equilibrium point of 
interest is ue = 0 

Du = Du + ye) = f(u + ye) =: gw). 


An equilibrium point is stable if for each c > 0 one can find an € > 0 such that 
Ilse. = > |ly@\l<c, tto. 
It is asymptotically stable if it is stable and in addition € can be chosen such that 
Dæ@l<e = lim [ly =0. 


The set of all points y (to) such that || y(¢) || converges to zero as f tends to infinity is 
called the domain of attraction of the equilibrium point. If an equilibrium point is 
not stable it is called unstable. 

As already highlighted in Chap. 1, an important difference of time invariant non- 
linear systems compared to LTI ones is the possibility of the existence of multiple 
isolated equilibrium points. 


Example 9.2 


Consider the system described by the following differential equation 


Dy = —ay + cy’ 
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with a and c positive constants. From 

0 = -ay + cy” = cy(y — a/c) 
we see that the system has two equilibrium points: 

yt)=0 and = y(t)=a/c. 


We are interested in the dynamic of the system starting from the initial condition 
y(0) = yo assuming that yo doesn’t coincide with an equilibrium point. Since the 
function f(y) = —ay + cy? is locally Lipschitz continuous, there is a unique solu- 
tion and this solution doesn’t intersect the equilibrium points. The initial value prob- 
lem can therefore be solved by separating the variables and integrating 


y t 


|fe 
cy(y — a/c) J 


yo 


The solution is found to be 
—at 


= yg (l = e7) | 


y(t) = Yor 


If yo is negative or O < yoc/a < 1 the solution converges toward zero which therefore 
is an asymptotically stable equilibrium point (see Fig. 9.2). If yoc/a > 1 the solution 
diverges and reaches infinity in the finite time 


From the above example we see that a nonlinear system can have multiple equi- 
librium points some of which can be stable, and some unstable. For a system to 
remain stable around a stable equilibrium point the initial condition may have to 
remain within a limited region around that point. Also, divergence from initial con- 
ditions near unstable equilibrium points can diverge faster than exponentially and 
reach infinity in finite time (finite escape time). 

One of the most useful tools in the study of the stability of equilibrium points is the 
Lyapunov stability theory [24]. In particular Lyapunov’s linearization (or indirect) 
method, states that 


e If the linear approximation of the system about an equilibrium point is asymptot- 
ically stable then, in a neighborhood U of the equilibrium point, the (nonlinear) 
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Fig. 9.2 Solutions for 
various initial conditions of 
the initial value problem of 
Example 9.2 with a = 1 and 
c=1/2 


system is asymptotically stable. The largest neighborhood U is the domain of 
attraction of the equilibrium point. 

e If the linear approximation of the system about an equilibrium point is unstable, 
then the (nonlinear) system is unstable. 


If the linear approximation of the system is neither asymptotically stable nor unstable 
then this method is inconclusive and one must turn to other methods, for example, 
Lyapunov’s direct method [24]. 


Example 9.3 
Consider the initial value problem described by the differential equation 

Dy = cy? 
with c a constant; and the initial condition 

y0) = yo. 
The only equilibrium point of the equation is the zero solution ye(t) = 0. As it’s 
immediately seen, the linearized equation is stable, but not asymptotically stable 
about the equilibrium point. 


The nonlinear equation can be solved by the method of the separation of the 
variables 


Performing the integrations and solving for y we find 
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Fig. 9.3 Solutions for 3 
various initial conditions of 
the initial value problem of 
Example 9.3 with c = —1 


0 2 4 6 8 10 


Yo 
4 1— 2cyêt 


If c > 0 the solution diverges and reaches infinity at 


yt) = 


_ 1 
~ Beye” 


loo 


Ifc < 0 the equation is asymptotically stable for any value of the initial value yo (see 
Fig. 9.3). 


Differently from what the above examples may suggest, most nonlinear differen- 
tial equations can’t be solved analytically. Therefore we are interested in methods 
to find approximate solutions around asymptotically stable equilibrium points in the 
spirit of a perturbation theory. Weakly nonlinear systems are a class of systems for 
which such a method exists and the solution is obtained in the form of a functional 
series. 

Informally weakly nonlinear systems can be described as systems operated around 
an asymptotically stable equilibrium point and whose response depends continuously 
on the input signal x. They include systems described by a differential equation of 


the form 
Dy = Cx + f(y), 


f:Y>R’, C:X> R", XCR, YCR” 


with C a linear function and f a function that, within the excursion range of interest 
of y, can be approximated to any desired accuracy by a Taylor expansion. Note 
that polynomials are locally Lipschitz continuous. For this reason weakly nonlinear 
systems are well-behaved and produce a well-defined and unique output response. 
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9.2 Graded Algebra of Test Functions 


In the previous section we illustrated some aspects of weakly nonlinear systems 
based on examples of systems described by nonlinear differential equations. We now 
look for a description based on distributions. We’ll see that this allows reducing the 
problem of solving some classes of nonlinear differential equations to an essentially 
algebraic problem. However, before discussing systems, we need some preparation 
that we provide in this and the next section. 

Let Vg, k € N be vector spaces on C such that Vg N V; = {0} fork A j. The direct 


SUM bs 
V := p V; := p V, (9.3) 


is the vector space whose elements are the sequences (xx) in Uio Vg with x, € Vk 
and x, = 0 for fast every k. That is, the set of all finite sequences with x, € Vg. The 
vector space structure of V is defined by the following addition and multiplication 
with scalars 


(i) Hck) = Oe + ey), Oe), OD EV, ceCc. (9.4) 


Each Vx is evidently a sub-vector space of V. 
If furthermore V is provided with a multiplication 


V xV >V, (x,y) > xOy 
such that it forms an algebra and in addition 
Vi O Vj C Vejo k,jeN 


then it is called a graded algebra. 
Let Vi = D(R*) be the vector space of test functions on R* with Vy = C. Then 


De = A De) 


k>0 
with the tensor product as multiplication 
PB Y(T... Tk, Tkl <- -o Tet) = O (T1, «++, TK) Y (Tk+1, <--> Teh) 


is a graded algebra that we call the graded algebra of test functions. We write elements 
of Dg as sums with indices denoting the grade of the element 
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N 
p=) pj, o)€ DR), NEN. 
j=0 


In the graded algebra of test functions we define the following convergence cri- 
terion. A sequence (m), m E De with 


Nm 


Pm = X` dim , Pj,m E€ D(R/) 
j=0 
converges to zero if 
1. There exist compact sets K; C R/, jJ=1,...,N with N = MaxXmen(Nm) such 


that for each j and m 
supp($j,m) C Kj. 


2. For every j > 0 and every j-tuple k € N/ the sequence (D‘o j,m)meN converges 
uniformly to zero. For j = 0 the sequence of numbers (¢0,m)men converges to 
zero. 


9.3 Direct Product of Distributions 


The direct product V of vector spaces V; on C is the vector space whose elements 
are the sequences (x+) with x, € Vg, k € N. The vector space structure is defined as 
for the direct sum by (9.4). It is denoted by 


V := I] V, := I] Vg. (9.5) 


k>0 k=0 


The key difference from the direct sum is that, in a direct product, the sequence does 
not have to be finite. 
Let V; = D’(R‘), with Vo = C. Then the direct product 


D, :=[ [205 


k20 


is the set of linear continuous functionals on Dg defined by 


(oe) 


h:De>C, ¢ (hg) := X > (hj, o) (9.6) 


j=0 


with 
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Since ¢ only has a finite number of terms different from zero, (h, @) is well-defined. 
As fork 4 j, D' (R*) A D'(R/) = {0}, here and in the following we denote elements 
of D, by sums in a similar way as we do for elements of Dg. 

Continuity in D% is defined by the convergence that we defined for Dg and follows 
from the continuity of distributions. Since Dz is a vector space, it’s enough to verify 
continuity at the origin. Let h € D4 and @ € Dg, then there exists an N € N such 


that 
N 


(a, p< YOANI sup hj, gj) 


Fo j€(0,...,N} 
and according to our definition of convergence, when ¢ converges to zero, so does 
sup ;|(1;, 6;)| and hence (h, o). 

In Sect. 3.1 we have introduced the tensor product of distributions and have seen 
that it is well defined between any pair of distributions. With it we can define a 
product g - h between elements g and h of Dg. It’s kth component is defined by 


k 
(gh) = (8-A => gi Ohje kEN (9.7) 
j=0 


with g; and h; the jth components of g and h respectively. With this product 
(Di, +, -) becomes an algebra. As is common practice, we will often denote g - h 
simply by gh. Being based on an associative operation (the tensor product) the prod- 
uct that we just defined is associative. 

Note the close similarity between the algebra of formal power series and the one 
that we have defined for D4. In both cases addition is defined component wise and 
the product has the form of a convolution. 


9.4 Symmetric Distributions 


Let Sẹ denote the set of all permutations of {1,...,k}. A distribution hg € D' (R*) 
is symmetric if 


(hk, (T00), ++ +5 To(ky)) = (he, O(T1, «+ +5 TR)) (9.8) 


for all permutations o € S, and every @ € D(R*). Symmetric distributions are fully 
characterized by symmetric test functions for 
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1 
(he, @(T1, +--+, Tk)) = his yy ETT) 


o ESk 


and the sum of test functions on the right-hand side is a symmetric test function. The 
sum of symmetric distributions is a symmetric distribution. Therefore, they form a 
vector subspace of distributions that we denote by Dym (RÝ). Similarly, we denote 
the vector subspace of all symmetric test functions on R* by Dsym (RÝ), the one of 
the direct sum of symmetric test functions by De, sym (IR*) and the one of the direct 
product of symmetric distributions by Dsm (R5). 

A symmetric distribution can be constructed from an arbitrary distribution f € 


D' (RÝ) by averaging over all permutations of the independent variables 
1 
Llym = 5 Dy fos «+++ To) (9.9) 
o ESk 


with 
(f (Toys <- -s Tok), (T1, +++ TI) = (f (T1, <- Tk), (T01); ++ +s To())) - 


Such an operation is called symmetrisation. 

The tensor product is a bi-linear operation. Therefore, the power of an element 
of Dz composed by a finite number of distributions f; € D'(R™), nj > 1, j = 
1,...,m,m > 2 can be expressed as a sum of tensor products 


m 


ii 
j=l 


m m 


yew fa- @ fj, KEN 


k 


j=l jk=1 
with the sum ranging over all possible combinations of the indexes j},..., jx. If the 
distributions fi, ..., fm are symmetric then one can reorder the indexes j,,..., jx 


by any permutation o without changing the value of the sum. Hence, the tensor 
products on the right-hand side can be replaced by symmetrized products 


DYD Ff [a88 ficlsym - 
j=l Jal j=l jr=l 


The tensor product of symmetric distributions inside the symmetrisation operator act 
as a commutative operator. For this reason the sum includes summands that are equal 
and, by grouping them, we obtain an expression that is similar to the multinomial 
formula [21] 


9.5 Weakly Nonlinear Systems 143 


k 
k! 


eA) = allm f= fee Sa) (9.10) 
j=l lal=k 
with a an m-tuple in N”, 
TE = a Q ve Q Jo% (9.11) 


and where we made use of the multi-index notation introduced in Sect. 4.6. 

In general the product that we defined on Dg applied to two elements of D 
does not result in an element of Dg sym: 
product 


/ 
®,sym 
This can be remedied by symmetrizing the 


k 
(gh) = (¢- We = D [8) Oele Ehe Dogm (9.12) 
j=0 


/ 


Unless explicitly stated otherwise, when working in Dg sym 


the use of this symmetrized product. 

The last property of symmetric distributions that we want to mention is the fact 
that, in a convolution algebra, the inverse of a symmetric distribution is symmetric, 
for 


we will always assume 


S(t, T2) = f(t, T2) * f* | (t4, T2) 
= f(m, 1) * f* | (T1, T2) 
= f(t, tm) * f* '(m, 11). 


9.5 Weakly Nonlinear Systems 


We are looking for a representation, in the spirit of a perturbation theory, of a class of 
nonlinear systems including the ones described by differential equations of the form 


K 
Ly=x+)oexy* (9.13) 
k=2 


with x € D’(R) a given input signal, L a linear differential operator with constant 
coefficients 
L = D” + ap- D”! 45-4 qD+a 


and where we assume that the linearized system is stable. 
In Chap. 7 we saw that, in the language of distributions, a linear differential 
equation with constant coefficients becomes a convolution equation. If we want to 
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apply the results obtained for convolution equations, we need to give a meaning to 
the nonlinear terms appearing in the above equation. 

In general, it’s not possible to define a multiplication valid for arbitrary distribu- 
tions. Therefore, the terms y*, k > 1 can’t be assumed to belong to D’(R). To work 
around this problem we can assume y to belong to a direct product of distributions, 
y = (yo, 1, Y2,---), and use the product defined on that space. Since the product 
between functions with values in C is commutative f - g = g - f, we require y to 
belong to the direct product of symmetric distributions De ons Then, if yı is the 
solution of the linearized equation its powers become tensor powers 


(yi) = y. 


If yı is a regular distribution, that is a locally integrable function, then we can recover 
the meaning of the powers in the differential equation by evaluating ye on the 
diagonal 

yM, ..., t) = ye). 


The same remains true if we replace yı by a sum of distributions. 

To complete the interpretation of the differential equation in the language of 
distributions it remains to be clarified what is the effect of the one dimensional 
differential operator D appearing in (9.13) on the components yg € I) ii (RÝ) of y. 
To this end, suppose y% to be a regular distribution. Then it is a locally integrable 
function 


Yk | T YklTi, -3 Tk), t e RÝ 


and we can associate with it a function of the single variable ¢ by defining an operation 
that we call “evaluating on the diagonal” 


Va.) := th yl(t,...,0), teR. 


If we assume this function to be differentiable, then the derivative with respect to t 
is well-defined 


Deva (yk) (t) = Diyg(t,...,0) +--+ + Deyelt, ..., t) 
and, as a distribution, can be represented by 
k-1 
Dyk := os 52) @ DS Q p=) * Yk. (9.14) 
j=0 


This last expression is symmetric and is valid for arbitrary distributions. Therefore, we 
can take it as the definition of the effect of the differential operator D on distributions 


yk E Dym R”). For y € Dg sym and any ġ € De, (y, $) only has a finite number 
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of terms different from zero. For this reason the effect of D on y can be defined as 
acting on each component individually. 

For y € Dosya to be a solution of (9.13) in a convolution algebra, the equation 
must be satisfied by each component y,; of y individually. If y has to be compatible 
with our assumption of the system being described around the zero equilibrium point, 


then the Oth component yo must always be zero 
yo =0. 


In analogy with the theory of formal power series we call distributions y € D with 
yo = 0 nonunits [25]. 
For k = 1 the only terms belonging to D' (R) appearing in the equation are y; and 
x. Hence, yı is the solution of the linearized equation and, as discussed in Sect. 8.1, 
can be represented by 
y= hy xX. 


For k = 2 we have 
Lô * y= c2 ye? 8 € D'R?) 


and we see that, for the computation of y2, the tensor power of yı plays the role of an 
input signal applied to a linear system. Assuming that Lô has an inverse, we obtain 


y2 = c2 (LE)! ey 


The above expression can be further manipulated by noting that 


((a(T1) ® b(t2)) * (f (t1) 8 g(T2)) , $ (T1, T2)) 
= ((a(T1) 8 bm)) 8 (F(A1) 8 802)), O( + A1, T2 + A2)) 
= ((a(t1) * f (T1)) ® (b(t) * 8(T2)) , $ (T1, T2)) 


or 
(a@b)*(f@g)=(ax f) (beg). (9.15) 


With this expression and the solution found for y; we can express yz as 
yo = hy * xe, hy = cy (L)! x h? 


where raising to a tensor power is assumed to have higher priority than convolution. 

From this it is not difficult to see that every component y+ can be expressed as the 
convolution of a distribution hg specific to the problem and the input signal x raised 
to the tensor power of k 


yk = hye x, 
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Fig. 9.4 Block diagram i i 
representation of a ' : 
time-invariant weakly oc 
nonlinear system. If the input t— hk };oeo 
signal is proportional to the 
constant c € C, the output of 1 l 


the block characterized by œ c? 

the kth order impulse m h > 
response h is proportional 

tock 2 


We are therefore led to define a weakly nonlinear (or analytic) time-invariant (WNTI) 
system as a system H whose behavior around the zero equilibrium point can be 
described by an element h of D syi such that, when driven by the input signal x, 
its output is given by 


oe) 
y = h[x] = Y hax, heH, xeA, (9.16) 
k=1 


with A, a convolution algebra in D' (R) and A, a convolution algebra in Dym R5) 
compatible with A, and the tensor product. This means that, if x € A|, then x®* 
must be an element of A;,. We denote such a set of convolution algebras by Ag sym: 
The distribution h, is called the kth order impulse response (or kernel) of the system. 
A block diagram representation of a weakly nonlinear system is shown in Fig. 9.4. 
Note that, if the input signal is multiplied by a constant c € C, yx is scaled by a factor 
of ck 
Ve = hg * (c x)® = (h; * x) . 

The interpretation of the output of our definition of a weakly nonlinear system 
requires some comment as it doesn’t always represent a quantity that can be inter- 
preted as a signal depending on time. Under the assumption that all involved dis- 
tributions belong to a convolution algebra, then one can distinguish the following 


Cases 


e If the impulse responses hg as well as the input signal x are regular distributions 
and the convolutions hg * x®* are well-defined (see Sect. 3.2) then all output 
components yx are locally integrable functions. In this case we can evaluate the y 
on the diagonal 
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eva (yk) (t) = eva (hg * x®™)(t) = 


and obtain an interpretation for the yọ as signals of time. 
If the input signal is scaled by the constant c, then, at each time t, the output 
evqa(y)(t) is seen to be a power series in c 


eva(y)(t) = X cheva hy * x) (1). 
k=1 


If this series has a convergence radius greater than zero valid at all times, then 
eva(y) represents a well-defined function of time and we have a clear procedure 
to interpret the output of the system. 

If some or all of the impulse responses hg are not regular, there is still a class 
of input signals for which all yọ are regular distributions. (Remember that the 
convolution of any distribution with a test function is an indefinitely differentiable 
function.) The system restricted to this class of input signals may still be evaluated 
on the diagonal to obtain a function of time evg(y) as in the previous case. 

If for no input signal (different from zero) there is a constant c > 0 such that 
eVqa(y)(t) remains finite at all times then the system can’t be represented using an 


/ 
element of Dg sym: 


Example 9.4: Polynomial System 


In this example we consider a class of systems whose impulse responses are not 
regular. 

Suppose that the output of a system H is represented by a nonlinear function f 
of the input signal x and that the function f can be adequately approximated by a 
Taylor polynomial around the origin 


SOO 4 
X 
k 


, 
! 


K 
y=f@r>d FO) 20e- K>1. (9.18) 
k=1 


It is readily seen that such a system can be represented by the impulse responses 


k=1,...,K. 


k= LO 5k 
, 


k 


The response of the system to the input signal x as represented by these impulse 
responses is 
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K fD 
y = Ale] = J Lae at, 


k=1 


If the input signal is not a regular distribution, for example if it is a Dirac pulse, then 
neither the initial representation given by (9.18), nor the evaluation on the diagonal 
evq(h[6]) do have a meaning. In spite of this, the impulse responses and their outputs 
yz are mathematically well-defined. 

If the class of input signals is restricted to regular distributions then the output 
obtained from the representation in terms of impulse responses by evaluating on the 
diagonal evg(h[x]) agrees with the original one. 

If f is analytic, then it can be represented by a power series (K — oo). In this 
case the output of the system is only well defined if the magnitude of the input signal 
|x(t)| remains smaller than the convergence radius of the series at all times. 


Let h; be the kth order impulse responses of the weakly nonlinear system H and x 
its input signal. In Sect. 3.3 we saw that an arbitrary distribution can be approximated 
to any desired accuracy by a finite sum of Dirac pulses. Hence, x can be approximated 
by 


M 
xaJ > ajd(t—Aj), aj €C, A; eR 
j=l 


and the output of hy by 


M M 

Yk x ie haraj — Ài) @aj5 lT =À) 
j=l 
M 
j=l 


j=1 


M 
X ape ay helt m Te A JE) 
Jk=1 


This expression suggests the interpretation for Ag as that portion of the system defin- 
ing how the response of the system depends on the combination of k simultaneous 
points in time of the input signal. 
In addition, if we compare the expression representing the output at time ¢ of the 
(causal) impulse response hg 
eva (Ax * x) (1) 


with the one of a polynomial system (see Example 9.4) 
eva (c,d * x?) (1) = cp x*(1) 


we see that, the output at time ¢ of the latter only depends on the kth power of the 
current value of the input signal. In contrast to this, the output at time t of the former 


9.6 Nonlinear Transfer Functions 149 


depends on all combinations of products of k (past) values of the input signal. The 
impulse responses h+ can thus be interpreted as the memory of the system. The given 
representation of weakly nonlinear systems can be seen as a generalization of the 
Taylor approximation method for memory-less systems to systems with memory. 
It is called the Volterra functional series in honor of V. Volterra who first proposed 
it [5]. 


9.6 Nonlinear Transfer Functions 


All impulse responses hg of a causal weakly nonlinear system must vanish if any 
argument T; is less than zero. This is most easily seen if we consider the case where 
the impulse responses as well as the input signal x are regular distributions, for then 


eva (yk) (t) = 


[oe] (oe) 


Jo en eee OEE EL 


As every distribution is the limit of smooth functions, this must then be true for 
arbitrary distributions. The impulse responses of all orders of causal systems are 
therefore right-sided distributions. 

The Laplace transform of the k-order impulse response h, is called the nonlinear 
transfer function of order k 


Ay (51, ..-, Se) = (het, «--5 TH), © EY (9.19) 
Due to the symmetry of hg, it is a symmetric function of the variables sı to sg 
Ay (51, Sk) = Ak(Soq), <- -3 S0), =F E Sk. (9.20) 


As the Laplace transform converts convolution products into ordinary products, the 
Laplace transform of yg = hx * x is 


Yy(s1,..., s) = Ag(sy,..., 5, )X (81) +--+ + X (sx). 


Just as with LTI systems, the many useful properties of the Laplace transform 
makes it a very valuable tool for solving convolution equations describing weakly 
nonlinear systems. In particular, on top of converting convolution products into ordi- 
nary multiplications, in their region of convergence, the Laplace transformed of 
distributions are holomorphic functions. 

Consider a system described by a differential equation with constant coefficients 
of the type considered before 
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Ly = Nx+ Djn¢jy 
L = D” + an1 D”! +--+ +a (9.21) 
N = bp D” + bm-1 D”! +- -- + bo. 


The part of the corresponding convolution equation relevant for the calculation of 


Yk, k > 1, is 
k 


(L8**) ye = Yo cj ++ + ye). 
j=2 
As the Laplace transform of Dé®* is 
LIDESHsi, a., Be) = ape eo 
(see (9.14)), the Laplace transform of Lé®* isa polynomial in sı + ---+ sx 


Psi +++ +54) = (81 +++ + 54)" + anali + +++ + 5K)" Ho +a. 


Note that the coefficients of this polynomial are the same for all k, including k = 1. 
The only difference between the various values of k is in the argument. If we factor 
it, we see that the denominator of H, adds to the denominator of the lower order 
transfer functions H;, j = 1,...,k — 1 terms of the form 


Giese Bi) 


with p; the jth pole and /; its multiplicity. If we assume H; to be a proper rational 
function, then its partial fraction expansion will include terms of the form 


F(s,..-, Sk—1) 
(iy rene H sk> pj)” 


and similar ones where some of the variables s1, ..., 5;,_; may be missing. If by the 
calculation of the inverse Laplace transform we start by inverse transforming with 
respect to s we obtain the expression 


D-i ipie 
F(s1, a, S1) T EPT Ootd 1, (r). 


By using the shifting property of the Laplace transform and denoting by f the inverse 
transform of F, the complete inverse transform of the above expression is 


1 
ePi™ 14 (T4). 


b= 
f(t — Tk, +. Tk-1 — Tk) D T% 
If Hx is not a proper rational function, then it can be decomposed into a polynomial 
and a proper rational function. The inverse Laplace transform of the polynomial part 
results in Dirac pulses and its derivatives. 
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This shows that, if the system under consideration can be described by a differ- 
ential equation with constant coefficients of the indicated type, then, similarly to the 
first order impulse response h, the higher order impulse responses are sums of Dirac 
pulses, their derivatives and products of polynomials and exponential functions in 
the variables t1, ..., Ttg. In addition, it also shows that, if the linear transfer function 
H; (sı) has all its poles in the left-hand side of the complex plane, then not only does 
the regular part of h, (that is discarding the Dirac pulses and its derivatives) decay 
exponentially as its argument tends to infinity, but so also do the regular part of all 
higher order impulse responses hg. In particular, we see that all impulse responses 
are summable distributions 


hke Dp R),  k=1,2,.... 


In the following, unless explicitly stated otherwise, we are always going to assume 
the systems to be of this type. 


Example 9.5 


We revisit Example 9.2 and find an approximate solution of the initial value problem 
Dy = -ay + cy’, yO) =y, a,c>0, 


valid around its zero equilibrium point. 
As we saw, in translating an initial value problem into the language of distributions, 
the initial conditions become part of the equation which, in this case, comes to be 


(D +a)y = yoô + cy’. 


We can think of this equation as an equation describing a system driven by the input 
signal x = yoô. The solution of the equation y is an element of D and has the 


®,sym 
form 
oe) 
y= > hy x x”. 
k=1 


The system is therefore fully characterized if we find the impulse responses hg. The 
solution of the original problem is then found by multiplying each impulse response 
hy by yp 

Ye = heyk- 


To find the impulse responses we apply the input signal x = 6 and insert y = h 
into the equation. The equation is solved if it is satisfied by each component h, of h 
individually. The component h; can be computed from the equation and the impulse 
responses of lower order hj, j =1,...,k —1. 
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To find h; we retain only terms of the equation belonging to D' (R) 
(Db + a5) *hy =ô. 
If we Laplace transform the equation we obtain 
(sı +a)Hı(sı) = 1 


from which we immediately obtain the first order transfer function 


Hı (sı) = 


Sj ta 
and, by inverse Laplace transformation, the first order impulse response 


y(t) = 140) e". 


The second order impulse response h3 is found by retaining in the equation only 
terms belonging to D' (R?) 


(D + a)5® x hy = ch’. 
From the Laplace transformed equation 
(sı + s2 + a) (s1, 82) = c Hı (s1) Hı (s2) 
we immediately obtain the second order nonlinear transfer function 


c Hı (sı) Hı (s2) 


H(s1, s2) = PA E 
1 +s2 


Note that it’s often convenient to write higher-order transfer functions in terms of 
the first-order one. In this example 


Hy (s}, 82) = c Hi (sı + s2) Hı (s1) Hı (s2) . 


To obtain the second order impulse response we can inverse Laplace transform, 
first with respect to one Laplace variable, then with respect to the other one, and 
finally by symmetrizing the result. We first inverse transform with respect to sz the 
expression 

1 


Ay (sı + 82) My (s2) = [s2 + alee + a) ` 


Assuming sı 4 0! and expanding in partial fractions we find 


' The obtained expression is a continuous function of sı which we extend by continuity to sı = 0. 
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- (1 —e™! 2) e 4? 1, (t2). 


We then combine this expression with the other factors of H2 


Cc 


——— (l-e"")e“%? 1, (T 
(sı +a)sı ( ) HD 


and inverse transform with respect to sı. This can be done by expanding in partial 
fractions the first factor 


oa (cet a ae ea 


and by using the shifting property of the Laplace transform to find 


< [0 - e7) 140) — (1 e) 140 — n) 08” 140). 
Note that this expression is not symmetric and that if we had first inverse transformed 
with respect to sı and then to s2, we would have obtained an expression with tı and 
T2 exchanged. 
The second-order impulse response is obtained from the above expression by 
symmetrisation 


ho(t%, T2) = | [(1 e ti) (1 ete) 1i(y _ 7) | a 


sym 


where we have suppressed the explicit Heavyside step functions with the understand- 
ing that the expression is zero if t} < 0 or t2 < 0. As h3 is a regular distribution, it 
can be evaluated on the diagonal and we obtain 


eva(ha)(t) = - (e e). 


The third order impulse response h3 is found by retaining only elements belonging 
to D' (R3) in the equation. As a first step we write 


(D + a)8®? x h3 = c (hı + h2)? 


for no other term can produce distributions belonging to D' (R3). The right hand side 
can be expanded with the help of (9.10) and, retaining only the terms of interest, we 
obtain 

(D + a)ô® x hz = 2c [hy @ haem - 
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The Laplace transformed equation is 

(s1 + 52 + 53 + a) H3(s1, 52, 53) = 2c [H1 (51) Ao (52, $3) sym 
and with it the third order nonlinear transfer function is readily obtained 

A3(51, $2, 83) = 2c Ay (sı + 82 + 53) [H1 (81) Ao (52, $3) loym - 


By expressing H in terms of H, we can write H3 in terms of H, alone 


2 
3 (s1, 82, 83) = xc Ay (sı + s2 + $3) (s1) Hı (s2) Hı (s3) 


- [M1 (s1 + s2) + Hı (s1 + 53) + Hı (s2 + s$3)] - 
The computation of the third order impulse response proceeds along the same lines as 
the computation of h2. After some algebraic manipulations and exploiting the prop- 


erties of the Laplace transform we obtain a rather long expression whose evaluation 
on the diagonal is 


eva) = (E) (ee — 2e +) 


At this point it’s interesting to compare the first three elements of the approxi- 
mate solution that we computed here with the exact solution that we calculated in 
Example 9.2 and that we reproduce here for convenience 


—at 


1- ys = e7) | 


y(t) = yo 


If |yoc/a| < 1 the exact solution can be expanded in a geometric power series 


= eva (h1 yo + hays + hayg)(t) +- 


and see that the lowest order terms correspond to the calculated response components 
yı, y2 and y3. Note also that the convergence radius of the power series derived from 
the exact solution corresponds to the radius of the largest open ball, centered at the 
origin and contained in the domain of attraction of the equilibrium point 


9.6 Nonlinear Transfer Functions 


Fig. 9.5 Comparison of the 
approximate solutions of the 
logistic differential equation 
in terms of y1, y2 and y3 
against the exact solution 
y(t) for 
a=1,c=1/2,y=1 


(0, 
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ajc) = [|w ER y <<}. 


Figure 9.5 compares the exact solution of the initial value problem with the approx- 


imation given by eva(yı + y2 


+ y3) fora = 1, c = 1/2, yo = 1. 


While for this particular example it was easier to compute the exact solution than 
to calculate the approximation, the latter allows us to obtain the output of the system 
described by the differential equation 


Dy +ay = x + cy? (9.22) 


for any input signals x € D'_ (R) maintaining the system withing the region of attrac- 


tion of the equilibrium point 


eva(y)(t) © eva(yı + y2 + y3)(t) 


with 


t 


yO = f ie = xan 


0 


MOL J / int — t,t — Walepadades 
0 0 


eva(y3)(t) = J / J h3(t — t1, t — tT, t — 13)x (tT) x (T2)x (13) dt dtd: . 


0 0 0 
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Fig. 9.6 Comparison of the 
approximate solutions of 
(9.22) with a sinusoidal input 
x(t) = 14 (t) sin(t) in terms 
of the three lowest order 
response components y1, y2 
and y3 against the solution 
y(t) obtained by numerical 
integration for 
a=1,c=1/2 


Here and in many problems, this amounts to limiting the magnitude of the input 
signal to sufficiently small values. Figure9.6 show the approximate solution for a 
sinusoidal input x(t) = 14 (t) sin(t) and compares it to the solution obtained by 
numerical integration of the differential equation for a = 1, c = 1/2. 


This example shows how by representing the solution of a nonlinear differential 
equation describing a weakly nonlinear system by a sequence of distributions y € 
Deca we have reduced the problem of solving a nonlinear differential equation to 
an essentially algebraic problem. While some expressions are rather long, they can 


be manipulated rather easily by modern computer algebra systems (CAS). 


Example 9.6 


We revisit Example 9.3 and try to find an approximate solution in Do aym of the 
initial value problem 


Dy =cy’, y(0) = yo, c <0 


valid around its zero equilibrium point. Note that the linearized equation is stable, 
but not asymptotically stable. 

As before we calculate the impulse responses by setting yọ = 1. The solution for 
an arbitrary yo is then found by multiplying the kth order impulse response hg by yk. 

The first order impulse response h; is found by writing the convolution equation 
corresponding to the above initial value problem and retaining only terms of first 
order 

Dô xh =ô. 
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By Laplace transforming the equation, the first order transfer function H; (s1) is 
found to be 


1 
Hı (sı) = — 
1 


From it, the first order impulse response is 
hy (t1) = 14(%1). 


The equation doesn’t have second order nonlinearities. Therefore the second order 
impulse response and the second order transfer function are both zero 


ho(%],T)=0, Hbo(s1,52)=0. 


The third order impulse response is found by retaining all third order terms in the 
convolution equation 


Dô x hz = ch. 
By Laplace transforming the equation we find for the third order transfer function 


c 
(s1 + 82 + 83)515253 


H3 (s1, 82, 53) = 


From this, the third order impulse response is obtained by inverse Laplace transform- 
ing with respect to one variable at a time and by symmetrizing the result 


h3(T1, T2, T3) =¢[ati(a) + (T2 — 13)14.(13 — T2) 


+14 = mD[ = a) = 11) + (i — 1)14 (13 — 2] 


sym 


From the above results we could conclude that, to third order, the approximate solu- 
tion of the initial value problem is 


eva(y)(t) = vole) + eyi 1 Ot +. 


This is however only valid for sufficiently small values of t. The reason is best seen by 
comparing the above expression with the exact solution of the initial value problem 
that we obtained in Example 9.3 and that we repeat here for convenience 


Yo 


4 1— 2cyêt 


The Taylor expansion around zero of the function 


yt) = 
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is 
eg Ms E T MST 
XxX XxX XxX eee 
a oe aaa Pr Or ae ee 


and has a convergence radius of 1. Therefore, as long as |2cyêt| < 1, the exact 
solution can be represented by the power series 


3 5 35 
y(t) = yo [i + cyot + 5 (vat) + 5 (eat) + z (00 +- | 


whose first two terms coincide with yoh (t) and eva( yghs)(t) respectively. However, 
as t increases, the higher order terms become more and more important and, when 
|2cyêt| = 1, the Taylor expansion stops being a valid representation of the exact 
solution of the initial value problem. 


The last example shows that, in general, the solution of a nonlinear differential 
equation in terms of an element of Dya is only meaningful around an equilibrium 
point for which the linearized equation is asymptotically stable. The reason being 
that, if this is not the case then the response of the system to any part of the input 
signal can persist indefinitely in time without ever decreasing to negligible levels. 
Since this is true for the response of any order, the output eva(y) can not in general 
be represented by a power series. We can say that systems that are representable by 
a Volterra series are those whose output does not depend on the too distant past. 

In the case in which the linearized system is asymptotically stable all impulse 
responses are summable distributions. Their Fourier transforms are therefore con- 
tinuous functions that can be obtained from the nonlinear transfer functions H, by 


hy (wr, -.., @k) = Helo,- JOR). 


As the nonlinear transfer functions are rational functions, the Fourier transforms hy 
are indefinitely differentiable and of polynomial growth, so they belong to Oy. 


9.7 Periodic Input Signals 


In this section we investigate the response of weakly nonlinear systems to periodic 
input signals. Given a periodic input signal x, every tensor power x®* is evidently 
also a (higher dimensional) periodic distribution. Therefore, every component yx 
of the system response y can be calculated in the convolution algebra of periodic 
distributions and represented by a Fourier series. 
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Let x be a J -periodic input signal with Fourier coefficients 
Cm (x) = Lae eI rt) meZ. 
m T § , 


Further, let m = (mı, ..., mg) € Z* be a multi-index and w, = 22/7, then the 
Fourier coefficients of the kth tensor power of x are 


= (xP, gyal.) 


1 


1 
= — (x, eo UM eT} agi a eo IMeMeTE 
= Cm, (X) ihe Cm (X) R 


With this expression and a straightforward generalization of Eqs. (4.21) and (4.24) 
to higher dimensional distributions, the Fourier coefficients of y are readily seen to 
be 


Cm (Ve) = Age, -< , MEW) Cm, (X) ++ Cm, (X) (9.23) 


with Í; the Fourier transform of the kth order impulse response of the system. 


9.8 Multi-tone Input Signals 


In some applications, for example in the study of interference and distortion in com- 
munication systems, one is often interested in the response of a system to input 
signals consisting of sinusoidal tones. If the frequencies of the tones are commensu- 
rate, that is, if their ratios are rational numbers, then one can find a common period 
and the input signal is periodic. The system response can thus be obtained by using 
the results of the previous section. However, for multi-tone input signals the results 
are often more directly interpretable by using a different indexing scheme for the 
tones composing the output components y [13]. 


9.8.1 General Case 


Let’s consider a system driven by an input consisting of N complex tones 


N 
x(t) = XO Anxn(t), Xn(t) = a, Ay i= |Anne?” 


n=1 
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initially assumed to have commensurate angular frequencies w1, . . . , @y. Our objec- 
tive is to calculate the system response of order k 


yk = hye xP, 


Consider first the tensor power x®*. It can be expanded with the help of (9.10) 


N @k 
yS: = (> sur) 


n=1 


k! 
® ® 
= 5 AT ANN 3 [xi m g a ee 
|mj=k ` 


with m the multi-index m = (mı, ..., my) whose elements range from 0 to N. 
Observe that this expression is the Fourier series representation of x®*. With it and 
(9.23) the Fourier series representation of yg is thus found to be 


k! mı my T @m, my 
w=} A +++ Ay hrm [xi 8 8 XN loy 
|m|=k Mm 
hk m = hy(@1,...,@1,.-.,@N,---,@n) (9.24) 
— —— el 
mı My 


with Á; the Fourier transform of the impulse response of order k. As this sum is finite 
and only composed by indefinitely differentiable functions, it is itself an indefinitely 
differentiable function that can be evaluated on the diagonal 


ye(t) = VODNO = D> yam) (9.25) 
|m|=k 
k! F 
Yem (t) = — AT! ANY hkm E (9.26) 
m. 
N 
Om = Som, on = mo, +++ + myo - (9.27) 


n=1 
The kth order response of the system is therefore a sum composed by 


(N —1+h)! 


(N — DIK! (2.28) 


complex tones, each one uniquely determined by a specific multi-index m. In this 
context the multi-index m is also called a frequency mix and |m] its order. 
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These results show several important properties of weakly nonlinear systems. 


e Incontrast to linear systems, weakly nonlinear systems generate tones at frequen- 
cies not present at its input. 

e In general, tones at a specific frequency are generated by frequency mixes of 
various orders. 

e To fully characterize h k (and hence hz) one needs k input tones. 


At the beginning of this section we assumed the input frequencies to be commen- 
surate. If this is not the case then the input signal is not periodic, but almost periodic. 
For such signals one can still define a Fourier series [16, Sect. VI.9] and the obtained 
results remains valid. 


9.8.2 Real Case 
In this section we specialize the above results to the case of a real system driven by 
an input consisting of N sinusoidal signals 


N 
x(t) = X |An| cos(@nt + Gn) 


n=1 


and where we assume @,,..., ®@y > 0. To re-use previous results it’s convenient to 
represent the input signal in terms of complex exponentials and use separate indexes 
for positive and negative angular frequencies 


N 
1 
t)= 7> AnXn(t A-nX-n(t), n(t) := ee 
x(t) a POA), 0) 
An = |A,|e/*", An = An = |Anle/™, W-n = =n. 
The quantity A, is called the phasor of the sinusoidal signal 


|An| cos(@nt + Gn). 


With this notation and using the multi-index m = (m_y,...,m_1,™m1,..., my) the 
output component yx is easily calculated with the help of (9.24)-(9.27) 


yet) = D> Yent) 


|m|=k 
— 1 k! A”-N Amam A” h [Omt 
Yesm(t) = aE e ee | | cee, k,m È 
him = hy(@_n,...,@_Nn,.-.,@N,.--, ON) 
— — 


m-y my 
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N 
On = a Mn @n = (My —M_1)o, +--+ + (my —mM_yn)on. 
n=—N 
n#0 
To N sinusoidal input tones there correspond 2N complex tones. Therefore, in the 
real case, the sum is composed by 


(QN—-1+k)! 


CN — 1)Ik! Ole 


frequency mixes. 

In the real case there is some extra structure that we can exploit. Consider a specific 
frequency mix m = (m_y,...,M_1,™m,..., my). From the above expression, it’s 
apparent that the multi-index 


rv(m) := (My, ..., M1, M—1,..., M-N) (9.30) 


obtained from m by reversing the order of the entries does also appear in the Fourier 
series of yg. If m A rv(m) then from k!/rv(m)! = k!/m!, @w(m) = —@m, Arvin) = 
Amand hurvim) = him we deduce that the sum of ykm (t) and Yk rvon) (t) is a sinusoidal 
signal 
Yim (t) := Yk,m(t) F Yk,rv(m) (t) = 2R Vem} 
1 k! 
~ Xim! 


: COS(Wpt + @m + Wim) 


Ait" ine |Ay | tm-x Akm| (9.31) 


with 
firm = lik m| 1" (9.32) 
N 
Ym = $ Mn Gn = (mı — M1) ++ + (my — MV) ON - (9.33) 
n=—N 
n#0 


If m = rv(m) then the multi-index rv(m) is not distinct from m and the Fourier 
series component described by rv(m) coincides with the one described by m. In this 
case @,, = 0 and, as the system is assumed to be real, him must be real. The response 
Yk,m therefore becomes 


1 k! 2m 2mn f 
Yim(t) = Jk mi Â! l-o [An] hkm, m= rv(m) (9.34) 


Note that m and rv (m) can only be equal for even values of k. Also, note that there 
can be multi-indexes m resulting in @,, = 0 for which m = rv(m) doesn’t hold. 
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Example 9.7 


Consider again the system described by the differential equation 
Dy +ay=x-+cy’ a,c >Q. 


that we analysed in Example 9.5. Here we are interested in the steady state response 
of the system when driven by the input signal 


x(t) = |A| sin(wıt) = |A| cos(wıt — 2/2). 


In our previous analysis of this system we calculated the first three nonlinear 
transfer functions H1, H) and H3. Using those results, the output components y1, y2 
and y3 are immediately obtained from (9.31) and (9.34) without having to calculate 
any inverse Laplace transform. 

Concretely, as the input signal consists of a single sinusoidal tone, the frequency 
mixes are composed by two entries m = (m_,, m1). The output of first order yı is 
obtained from the above equations by setting k = 1 and by summing over all multi- 
indexes satisfying the constraint |m| = m_, + mı = 1. There are only two such 
multi-indexes: (0, 1) and rv((0, 1)) = (1, 0). The first order output of the system is 
therefore given by 

yi(t) =R {HA (ya) Ae!" 


with A = |Ale~/7/?. 

The second order response of the system y2 is obtained by setting k = 2 and 
summing over all multi-indexes under the constraint |m| = 2. There are three of them: 
(2, 0), (0, 2) and (1, 1). The first one is the reverse of the second one. Therefore, the 
contribution of these two is obtained from (9.31) 


Li w 
Y50336) = zN {Uor J01)A e? | i 


Since the remaining multi-index is equal to its reverse (1, 1) = rv((1, 1)), its contri- 
bution is the constant given by (9.34) 


yaa) = [AŻ M (Jo, Jor). 
The response of second order is thus 
ya(t) = y2 0,9 ©) + y2,- 
The third order response of the system y; is obtained by setting k = 3 and summing 
over all multi-indexes for which |m| = 3. There are four of them: (3, 0), (2, 1), (1, 2) 


and (0, 3). Two of them are the reverse of the other two. For this reason the response 
of third order of the system y3 is given by 
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y(t) = Y3.0,3) H) + Y3,c1,2) O) 


with 
Cc = lo H A? p3ait 
Y3,(0,3) (t) = { 3(J@1, J@1, J) Ave } 


ê 3 2 Jo\t 
aaO = ZH {Ha(—Jor, Jor, Joa? Ae’) 


Example 9.8: Two Tones Input 


Suppose that we would like to implement a causal real LTI system. However, due 
to unavoidable limitations of physical components, the implementation behaves as 
a real weakly nonlinear system characterized by the nonlinear transfer functions Hg 
(see Fig. 9.4). We are interested in its output when driven by an input signal consisting 
in two sinusoidal tones 


x(t) = |A| cos(@t + p1) + |A2| cos(wat + p2). 


We think of the two tones as closely spaced in frequency and denote the difference 
of their angular frequencies by Aw = w2 — @). 

As the input is composed by two sinusoidal signals, the frequency mixes have 
four entries m = (m2, mM_,, M1, M2). From (9.29) we calculate that there are 4, 10 
and 20 frequency mixes of order one, two and three, respectively. They are listed in 
Table 9.1. 

The first order output yı is the output that would be produced by a perfectly 
linear system. All other tones are undesired. In particular, while tones relatively 
distant in frequency from w; and @ are relatively easily suppressed with filters, tones 
close to them are much more difficult to filter out. The tones closest in frequency 
to œwı and œ listed in Table9.1 are the tones associated with the frequency mixes 
(1, 0, 2, 0), (0, 1, 0, 2) end their reverses 


3 E 
Y5,(1,0,2,0) (t) = 7% [A A? H3 (— Janz, JO}, joe | 


and 


3 = 
35.0.1039 © = ZH {A1 43 BC jor, Jor, joe TOO") 


both produced by nonlinearities of third order. 

The frequency mixes of fifth order are 56. Among them we can easily identify 
frequency mixes producing tones at every frequency generated by third order non- 
linearities, in particular at w; — Aw = 2wı — @2. To see this, start with a frequency 
mix m producing the frequency of interest and add the same number / > 0 tom, and 
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Table 9.1 Frequency mixes generated by the first, second and third order nonlinearities of a weakly 
nonlinear system driven by two sinusoidal tones 


order m = (m2, M—1, M1, M2) Om 
k=1 (1, 0, 0, 0) =a = —w, — Aw 
(0, 1, 0, 0) —o| = —0] 
(0, 0, 1, 0) | =o 
(0, 0, 0, 1) w = w + Aw 
k=2 (2, 0, 0, 0) —2a2 = —2w; — 2Aw 
(0, 2, 0, 0) —20, = —201 
(0, 0, 2, 0) 2a, = 20,1 
(0, 0, 0, 2) 2w = 2m, + 2Aw 
(1, 1, 0, 0) @2 — @| = —20; — Aw 
(1, 0, 1, 0) —o7) + a = —Aw 
(1,0,0,1) 0=0 
(0,1,1,0) 0=0 
(0, 1,0, 1) —w] + @2 = Aw 
(0, 0, 1, 1) wi + @2 = 2a; + Aw 
k=3 (3, 0, 0, 0) —3a72 = —301 — 3A@ 
(0, 3, 0, 0) 30) = —301 
(0, 0, 3, 0) 301 = 301 
(0, 0, 0, 3) 3w = 301 + 3A@ 
(2, 1, 0, 0) 2@2 — w1 = —3a@1 — 2Aw 
(2, 0, 1, 0) 202 + @| = —a@| — 2AM 
(2, 0, 0, 1) -a = —a — Aw 
(1, 2, 0, 0) w — 2w = —3@ — Aw 
(0, 2, 1, 0) -—| = —01 
(0, 2, 0, 1) 2a; + @2 = —@, + Aw 
(1, 0, 2, 0) -—a2 + 20; = a — Aw 
(0, 1, 2, 0) @| =| 
(0, 0, 2, 1) 2m, + @2 = 3a, + Aw 
(1, 0, 0, 2) @2 = w; + Aw 
(0, 1, 0, 2) —w] + 2@2 = a + 2Aw 
(0, 0, 1, 2) wi + 2@2 = 3w + 2Aw 
d, 1, 1, 0) -o = —w — Aw 
d, 1,0, 1) -—o, = —0] 
(1,0,1,1) w =o} 
(0,1,1,1) @2 = w] + Aw 
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Fig. 9.7 Hypothetical 

phasor diagram for the 

response at œ; — Aw under 

the assumption that c 
components of order higher ¥5,(1,1,3,0) 
than fifth are negligible 


Cc 
5,(2,0,2,1) 


Cc 
Y3,(1,0,2,0) 


total sum 


M_n for any n ranging from 1 to N (the number of input sinusoidal tones, here 2) 
m =(m_y,...,m_,+1,...,m, +1,...,my). 


Then the order of the new frequency mix m’ is 2/ higher than the one of m and 
the angular frequencies œm and @, associated with the two frequency mixes are 
identical (see (9.27)). 

Using this construction starting from (1, 0, 2, 0), we see that the fifth order mixes 
(2, 0, 2, 1), (1, 1, 3, 0) and their reverses produce tones at w; — Aw 


De (2 -Aw 
¥5,2.0.2,1 (t) = Safa Aj A2H5(— Ja, — Jo, J@1, Joi, Jone 1] 


5 aes 
¥5,1,1,3,0) (É) = ae {Az Ai A} Hs(—Jw@2, —J@1, J@1, J, Joe OTs] . 


The total response of the system at the frequency w; — Aw is therefore a possibly 
infinite sum composed by the above mixes and higher order ones 


e E C 
Y3,1,0,2,0) + ¥5,2,0,2,1) + ¥5,0,1,3,0) He 


This sum can be represented graphically by drawing the phasor of each summand 
as a vector in the complex plane and summing them by vector addition. Figure 9.7 
shows the phasor diagram for the above sum under the assumption that summands 
of order higher than fifth can be neglected. 

Observe that summands of different order depend differently on the amplitude 
of the input signals |A,| and |A2|. For small input amplitudes the third order one 
is usually the dominant. As the amplitude of the input tones grows, higher order 
summands become first significant and then dominant. This means that both the 
magnitude as well as the phase of the output tone does change with the amplitude 
of the input signals. At some level of the input tones there may even be a canceling 
effect where the output tone becomes very small. 

Among the 56 frequency mixes of fifth order there are several of them generating 
tones at new frequencies. In particular the closest in frequency to w; and œz (not 
generated by lower order mixes) are at w — 2Awm and w + 2Aø. Similarly, higher 
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Fig. 9.8 Positive part of a typical magnitude output spectrum of a weakly nonlinear system driven 
by two sinusoidal input tones. The number q above each spectral line indicates the lowest order 
nonlinearity generating the line. The same line is also generated by every nonlinearity of order 
q + 21,1 € N. Only lines generated by fifth or lower order nonlinearities are shown 


odd order frequency mixes introduce tones at new frequencies spaced by Aw from 
the previous ones. Figure 9.8 illustrates a typical spectrum of the output signal. For 
simplicity of representation the figure only shows lines generated by fifth or lower 
order nonlinearities. 
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Chapter 10 A) 
Composition of Weakly Nonlinear Time ciecie; 
Invariant Systems 


10.1 Cascade of Noninteracting Systems 


When building large systems, it’s common to construct them by combining smaller 
subsystems. To gain the ability to investigate such systems, in this section we study 
the fundamental operation of cascading two systems, that is, of connecting the output 
of one system to the input of another one. In our treatment we are going to assume 
that this connection doesn’t change the behavior of the involved systems. This is not 
always the case. Therefore, before applying what follows, we must carefully ponder 
this aspect. 

Consider the cascade of the weakly nonlinear systems G and H as shown in 
Fig. 10.1. Both systems are characterised by their respective impulse responses gg 
and h; that we assume to belong to a convolution algebra Hisy: We are looking 
for an expression to represent 


y = (ho g)[x] := Alg[x]], 


the composition of H after G that we denote by H o G. 
Let’s first consider the system G. It’s output z when driven by the one dimensional 


distribution x, is 
CO 
k 
Z= ) 8k eae ‘ 
k=1 


If instead of representing the input signal by a one dimensional distribution xı, we 
represent it by a sequence x = (0, x1, 0,...) € A with all its components but 


@®,sym 
xı equal to zero and use the product that we defined on sia then we can express 


z in the equivalent form 
[e6] 
A 5 gex xÉ. 
k=1 


© The Author(s) 2024 169 
F. Beffa, Weakly Nonlinear Systems, Understanding Complex Systems, 
https://doi.org/10.1007/978-3-031-40681-2_10 


170 10 Composition of Weakly Nonlinear Time Invariant Systems 


Table 10.1 Lowest order impulse responses of the composite system h o g in terms of the impulse 
responses of h and g 


(hog), =h; *g1 

(ho g)2 = hı * g2 + hy * gh 

(ho g)3 = hı * g3 +2 h * [g1 8 82]sym + h3 * se 
(hog) =h * g4 + ha =» [87] ym t22 * [81 8 83lsym 


+3 hz * [ge 8 g] + h4 * g? 


sym 


(ho g)s = hi * gs +2h2 = [81 8 g4 ]sym + 2h2 * [82 ® 83 ]sym 
+3hz * Ba 2 g3] + 3h3 * [gi @ 8” | ym 


sym 


+ 4h4 * Bi ® 22] +h; * a” 


sym 


The obtained expression is even more reminiscent of a power series than the original 
one and, more importantly, it is more amenable to generalisation. In fact, if we assume 


that this expression remains valid for arbitrary input signals belonging to Aes is then 


the same expression can be used to describe the output of H in terms of z 


oo 
y=) hz. 
k=1 


We can then define the composition of weakly nonlinear systems by 


[e] oo 
(ho Dixi] = (hog x xP = Do hee 
k=1 k=l 
= hy * (g1 * X1 Hg *xe? +--+) 
+ ho * (g1 x x1 + go xx 4-7 
+ hg * (81 * xi +g ¥xP H 


(10.1) 


with (h o g)ą denoting the kth order impulse response of the overall system and 
consisting of all terms of dimension k. Note that, for every value of k, there are only 
a finite number of them as the lowest tensor power of xı appearing in z” is the nth 
one and thus 

(z) =O for n>k. 


The first five components are listed in Table 10.1 for easy reference. Note, here as 
well, the analogy with power series and their composition [25]. 

The above definition by itself is not complete as the convolution between distri- 
butions of different dimensions is not defined. To complete the definition we have 
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Fig. 10.1 Cascade of the z 
system G with H x — ¢ ~> H | y 


Table 10.2 Convolutions between impulse responses of different order appearing in the composi- 
tion of weakly nonlinear systems and their definition. They are grouped by the resulting order, from 
second to fifth. To simplify the notation the symmetrization operation is not explicitly shown 


Convolution Definition 

hi * go [hi (t1) @ (T2 — T1)] * g2(T1, T2) 

hy * 83 [hi (T1) @ (T2 — T1, T3 — T1)] * 93(T1, T2, T3) 

hg * (81 8 82) [h2 (11, T2) @ (T3 — T2)] * [81 (t1) 8 82(12, T3)] 

hı * 84 [hi (t1) (T2 — T1, T3 — T1, T4 — T1)] * 84(T1, T2, T3, T4) 


h2 * (82 8 82) [h2 (T1, T3) @ 6(t2 — T1, T4 — T3)] * [82 (T1, T2)  82(T3, T4)] 

hg * (81 8 83) [h2 (t1, T2) (T3 — T2, T4 — T2)] * [81 (11) Q 83(12, Ts, T4)] 

h3 * ge 882) | [h3 (t1, 12, 3) ô (T4 — T3)] * [81 (T1) Q g1 (12) 8 82 (T3, T4)] 

hy * g5 [hi (t1) @ 6(t2 — T1, T3 — T1, T4 — T1, T5 — T1)] * 8g4(T1, T2, T3, T4, TS) 

h2 * (g1 Q ga) [h2 (T1, T2) @ (T3 — T2, T4 — T2, T5 — T2)] * [81 (T1) Q 84(12, T3, T4, T5)] 
hg * (82 8 83) [g2(T1, T3) @ (T2 — T1, T4 — T3, T5 — T3)] * [82(T1, T2) @ 83 (T3, T4, T5)] 
h3 » (88° @ 83) | [h3(T1, T2, T3) @ 5(t4 — T3, T5 — T3)]* [81 (T1) ® 81 (T2) ® 83(T3, T4, T5)] 
h3 * (81 ® gs”) [h3(T1, T2, T4) @ 6(t3 — T2, T5 — T4)] * [g1 (T1) 8 g2(12, T3) @ 92(t4, T5)] 
h4 * eS @ g2) | [h4(t1, T2, T3, T4) @ ô(T5 — T4)] * [81 (11) 8 81 (12) 8 81 (13) ® 8g2(T4, T5)] 


thus to give a meaning to all undefined convolutions appearing in the expression for 
(h o 8). 

Let’s consider the convolutions appearing in (h o g). The undefined ones are 
the ones involving h; with | < k. The first thing to note is that, for every /, all 
of them are convolution products between h; and a distribution that is the tensor 
product of / distributions. In addition, by definition, the sum of the dimensions of 
these / distributions must be k. The convolution products that we have to define have 
therefore all the form 


hı * Ew @---® evi" |, , œN (10.2) 


with 
k-I+1 k-I+1 


D3 ia; =k, 5 a; =l. (10.3) 


i=1 i=1 


The simplest case is the one for l = 1 


hy * 8g 
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which represents a nonlinearity of order k, gg, followed by a linear system h,. To 
find a suitable general definition for this convolution we let us guide by regular 
distributions belonging to L4, evaluated on the diagonal. Setting k = 2 for simplicity 


we have 


eva(z2)(t) = eva (g2 * x2") (t) 


Co œ 
= | f eOr = admit = adad. 
0 0 


Using this expression as the input of hı we obtain 


HOE I WiGQeoesara 
0 
= f ma f f e201) 
0 0 0 


“x(t — Ay — Ti )x (t — à2 — T1 )dàidà2dT1 


œ œ œ 


= | | | hads0i- na- nan 
000 


- xı(t — à1)xı (t — Az) dàidà2. 


Note that the innermost integral in the last expression is a convolution integral 
between h; and g2. It can be generalised to arbitrary distributions by building the ten- 
sor product of h; (t1) with ê (t2 — T1), the Dirac delta distribution in t parameterised 
(shifted) by T1 


([A1(t1) 8 (T2 — T1)] * 82(T1, T2), $ (T1, T2)) 
= (hi (t1) @ (T2 — T1) 8 B21, Az), Ø (T1 + AL, T2 + Ad) 
= (hi (T1) 8 821, à2), (8 (T2 — T1), (Ti + Ad, T2 + A2))) 
= (hı (t1) 8 8241, A2), (Ti + Ar, Ti + A2)) . 
The above derivation generalises without any difficulty to the convolution between 
hı and the impulse response of order k of G. Taking into account that impulse 


responses have to be symmetric, we thus define the convolution between h; and gg 
by 


(hy * 8k) (T1, -> Tk) = [h1 (t1) 8 (T2 — T1, «+, Tk — TH) gym * &k(T1, «++» Tk) - 
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In other words, to convolve h; with a distribution of dimension k we promote h; to 
a distribution of k dimensions by building the indicated tensor product and use the 
standard definition of convolution. 

The Laplace transformed of hı * gą has a very simple representation and leads to 
an easy interpretation. With 


(h(t) Q b(t —T,...,T% — T1), E UE) 
= (hi), (8T —T,..-,% — T1), a) 


= (hi (t1), een 


we find 
Lihi * Bh (Si, -3 Se) = Hi (s1 + +++ Sk) Ge(S1, -a Sk) - 


Therefore, if the input signal x; consists of N tones, the nonlinear system component 
gx generates new tones at frequencies that are linear combinations of k of the input 
frequencies at a time (see (9.25)). The linear system hı following it simply filters 
these newly generated tones as prescribed by its transfer function Hj, in accordance 
with expectation. 

Consider next the next simplest undefined convolution 


hz * [81 8 82lsym - 


As for the previous case, we look for a way to promote hp to a distribution of 
dimension k = 3 so that we can use the standard definition of convolution. We do so 
by working with multi-tone input signals as this leads to easier interpretations. 

Let g1 ® go be driven by 3 unit tones 


x1(t) = e721! + el 2! + ef 03! 3 


then its output is 


[g1 ® 82lsym * x8 


3 3 3 
=>) > yy SG etonja, jor ere 
mi Gi (Jan, Je? Ga(J Om, JOn, jE Ong 2th) 


with Gj (s1)G2(s2, 53) the Laplace transform of gı ® go. This expression suggests 
that gı ® g2 can be interpreted as the parallel combination of a linear system and 
a second order one. For each term of the sum, the tone at w,, passes through the 
linear system gı while the other two pass through g2. The output evg(g; ® g2)(t) can 
thus be considered as consisting of a sum of pairs of tones, one at w,, and the other 
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at Wn, + @,,. This sum of tones couples constitute the input of h2 which processes 
them and, for each tone couple generates the signal 


Hy(JOn,, JOny + JOn;)G1 (J@n, )G2(JOn,, J@n, el Om tt), 


Given these considerations, we define the convolution between hz and[g; ® g2] 
by 


sym 


h2 * [81 ® 82]sym = [C22 (t1, T2) B ê (T3 — T2)) * (81 (T1) ® 82(T2, 73) ) gym - 
Its Laplace transform is 
Lh * [81 ® 82lsym} (1, $2, 93) = [H2(s1, $2 + 93)G1 (51) G2(52, $3) sym - 


The above considerations can be extended to the general case (10.2). The tensor 
product of / distributions 


k—I+1 
Gay @a-141 _ 
81 Q OR 44 ; ) a; =1 
i=l 


can be thought of as a set of / parallel subsystems of order lower than k. The constraints 
(10.3) make sure that with k input tones, its output can be made to consists of / tones 
at linear combinations of the original input frequencies. These can then be passed as 
input to h;. 

The intended meaning of the generalised convolution expressed by (10.2) can thus 
be captured by promoting h; to a k dimensional distribution obtained by building 
the tensor product of h; and k — l appropriately shifted ô distributions constructed 
as follows. 


e The first independent variable of each of the / distributions 
Em: (T1; «+ <, Tm) D +++ @ Bm; (Trt, «++ Trtm;) D +++ @ Em (Thm 41s +++ > Tk) 
form the list of independent variables of h; 
hi (Tisana Taly. tees CR GA) 


e For each additional variable of g,,,, mj; > 1, we tensor-multiply A; by a Dirac 
distribution in this same variable, shifted by the first one 


8 (Tr42 — Tn+1) @ +++ @S(Tr4m; — Tn+1) - 


e The resulting k dimensional distribution has finally to be symmetrized. 
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Table 10.3 Convolutions between impulse responses of different order appearing in the composi- 
tion of weakly nonlinear systems and their Laplace transforms. They are grouped by the resulting 
order, from second to fifth. To simplify the notation the symmetrization operation is not explicitly 
shown 


Convolution Laplace transform 

hy * 82 Hı (sı + 52)G2(s1, 2) 

hy * g3 Ay (sı + 52 + 53)G3(S1, $2, $3) 

h2 * (g1 ® g2) [H2(s1, 82 + 53)]Gi(s1)Ga(s2, 53) 

hy * 84 Hı (sı + 52 + 53 + $4)Ga(S1, $2, 53, 54) 

h2 * (82 8 82) [H2 (s1 + 82, 53 + s4)]G2(81, 82)G2(53, 84) 

h2 * (g1 ® 83) [H2(s1, 82 + 3 + $4)]G1(s1)G3(s2, 53, 84) 

h3 * (8 Q go) [H3 (s1, 52, 53 + $4)]G1(s1)Gi (s2)G2(83, 84) 

hy * g5 Ay (s1 + 52 + 83 +84 + 85)Gq(s1, $2, 53, S4, 85) 

h2 * (g1 ® ga) [H2(s1, 82 + 3 + 54 + 85)]G1(51)Ga(s2, 53, 84, 55) 
h2 * (g2 8 g3) [H2(s1 + 82, 853 + s4 + s5)]G2 (s1, 82)G3(s3, S4, 95) 
h3 * (82° 8 83) [H3 (s1, 82, 53 + s4 + 85)]Gi(s1)G1 (s2)G3(53, 54, 55) 
h3 * (81 9 88°) [H3 (s1, 52 + 53, 54 + 85)]G1(51)G2(52, 53)G2(S4, 55) 
h4 * Er ® 82) [H4(s1, 82, 53, $4 + 85)]G1 (s1)G1 (s2)Gi (53) Ga (84, 55) 


A few convolution examples are given in Table 10.2. The Laplace transformed of 
these examples are tabulated in Table 10.3. With this definition we have completed 
the description of how to compose weakly nonlinear systems. 


Example 10.1: Third-Order Nonlinearity 


The third order nonlinearity of H o G is generated in three distinct ways: First, by 
the nonlinearity of third order of H applied to the output of the linear part of G 


H3 (s1, $2, 83)G 1 (81) G1 (52) Gj (82) X 1 (51) X 1 (82) X1 (53) , 
second, by the nonlinearity of third order of G passing through the linear part of H 
Ay (s1 + s2 + 53)G3(S1, $2, 52) X1 (81) X1 (52) X1 (53) 


and third, by the second order nonlinearity of H applied to the output of first and 
second order of G 


2 [H2(s1, 52 + 83)G1 (51) G2(52, 53) sym - 


These mechanisms are represented graphically in Fig. 10.2. In particular one 
should note that, even if neither G nor H shows nonlinearities of third order, the 
combined system H o G in general still has an impulse response of third order dif- 
ferent from zero. 
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Gi(s1) 
® 
m~~ Gi(s2) m A3(s1, $2, $3) 
® 
Gi(s3) 
xe Ys 
— 4 2G1(s1) t-— 
{| Q c H>(s1, s2 + 83) m 
G2(s2, 53) 
— G3(s1, 82,53) ~>] Ay (s1 + 52 + s3) 


Fig. 10.2 Graphical representation of the third order nonlinearity generated by the composition 
H o G. Each path has to be understood as symmetrised 


Example 10.2: Memory-Less Systems 


Consider the convolution(10.2) with A; a Dirac distribution of dimension / < k 
§2! | &aı S sd 
xg 8 @& 744 Sm 


By definition, the lower dimensional distribution 68’ is promoted to a distribution 
of dimension k by building the tensor product with shifted Dirac distributions as 
explained. For simplicity, we denote the promoted distribution by hg. Application of 
the convolution to a test function @ € D(R*) is defined by 


(into alsa eser]  .o@+ ») 


sy. 


= ([se" Q-Q seit | , (h(t), (Tt + i) 


sym 
with t, A € R*. The inner distribution is easily evaluated 
(h(t), p(t +.A)) = GA) 
and from this we conclude that for any l < k 
5° «| gh @ + @ geet ‘Tos =|" oo pty i= 


With this result we see that the response of amemoryless weakly-nonlinear system 
can be written in the following equivalent forms 
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[o0] 

y= oe =) oaa, Lek, (10.4) 
k=1 


In general we will use /, = k so that, if the input signal is a one dimensional distri- 
bution xı, we do not need to use the extended definition of convolution. 


Our definition of convolution between distributions of different dimensions and 
our definition of the one dimensional differential operator operating on higher dimen- 
sional distributions (9.14) are compatible. In fact the former is a generalization of 
the latter. Consider the differential operator acting on the k dimensional Dirac dis- 
tribution 5®*. Application to a test function o € D(R*) results in 


k k 
(D8, ¢) = ps Da, s) = {3 $n) 
j=l j=l 
k 
=—) Dj¢(0,...,0). 
j=l 


If we now consider ¢ as a function of the variable tı only and D,, the total differential 
operator, then we can write 


k 
-$ D0, ...,0) = —(5(t1), Dao (Ti, T1)) 
j=l 


= (Da8 (t1), 6(t1,..., 71) 
= ((D8) * 5%, g) 


which shows that our definition of the differential operator acting on a higher dimen- 


sional distribution is equal to the convolution of the one dimensional distribution Dé 
promoted by our definition of convolution to a k dimensional distribution 


D82 = (D8) x 82 . (10.5) 
This is also apparent from the Laplace transformed that in both cases are equal to 
S1 He ESk. 
The differential operator and the extended definition of convolution do satisfy 
(3.15). We show this by way of an example. Consider the convolution between h2 
and g; ® go. Suppose further that / is the derivative in the sense of (9.14) of another 


distribution w2 
ho = Dwz x 
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Applying the convolution product to a test function ¢ € D(R*) and using the 
extended definition of convolution we obtain 


(h2 * (81 8 82), $) = (Dun * (81 @ 82), $) 
= ({[Dw2(11, T2)] ® 6(t3 — T2)} @ [91 A1) @ 822, A3)], 
(Tt +A, T2 + Az, T3 + A3)) 
= ([Dw2(t1, T2)] 8 [81 (1)  82(å2, A3)], 
Q (Ti + At, T2 + à2, T2 +A3)) . 


Further, using the definition of differentiation and noting that 
Da (Ti + Ài, T2 +A2, T2 + à3) = (Da, + Dra )O (Ti + At, T2 + A2, T2 +3) 
we obtain 


— ([w2(t1, T2)] 8 [81 A1) 8 8242, A3)I, 
(Dr, + Da) (Ti + At, T2 + Aa, T2 +A3)) 
= — ([w2(t1, T)] @ [g1A1) 8 822, A3)], 
(Dj, + Dy, + DaO (Ti + At, T2 + A2, T2 + A3)) 
= (w2(T1, T2) @ D[gii) 8 822, å3)], 
P(t + Aj, T2 + Az, T2 + A3)) 


or, summarising 
(Dw2) * (g1 ® 82) = w2 * D(gi ® 82). (10.6) 


10.2 Feedback 


A powerful technique used in the design of all sorts of systems is feedback. In control 
systems design, this technique is used to stabilise and adjust the dynamics of a system 
to achieve a desired behaviour. It’s also used to reduce the sensitivity of systems to 
poorly controlled parameters. Here we are interested in describing the nonlinearities 
of a system making use of feedback based on the ones of its constituting subsystems. 

Consider the system shown in Fig. 10.3 composed by a forward subsystem G and 
a feedback subsystem H. The input of G is the difference between the input signal 
x and the signal z, a signal obtained by sensing the output y and suitably processed 
by H. The system is described by the following equations 
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Fig. 10.3 Weakly nonlinear + 
system with feedback x G y 


e=x —-Z 
z = (h o g)[e] 
y = gle]. 


Our objective is to obtain the impulse responses of the system based on the ones of 
G and H. We denote the overall system by W and its impulse response of order k 
by Wk. 

We start by computing the linear impulse response. The composition of linear sys- 
tems is obtained by convolving their first order impulse responses. We can therefore 
write the equation 

ey = ô — Z1 = ô — hı * g1 * €] 


and, solving for e;, we obtain 
ei = (6 +h, * g)! . 
With e; the calculation of the linear impulse response is immediate 
w = g1 * e1 = ( + hi * g1)* | * g1. 
Its Laplace transform is a classical result of linear system theory 


G(s) 


MOS TFMOGO 


If in the frequency range of interest the magnitude of the linear loop gain is large 
|Hı(JÆæ)Gı(Jæ)| > 1 then the linear response of the system is almost exclusively 
determined by the feedback network 


Wi(j@) © 


Hi (jo) 
For completeness, we give the Laplace transform of e; as well 


1 


Fy (sı) = 1+ AGG) E 
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With it the first order transfer function can be written as 
Wi (sı) = Gi (s))E\ (51). 


Next we compute the impulse response of second order W2. Using the generalised 
response of weakly nonlinear systems (10.1), the second order component of the 
output signal y and of the feedback signal z are given by 


2 
y2 = g2 * e? +H8gi*e 
2 
z2 = hz * yP + hı * yo. 
Note that, since we used a Dirac impulse as input, the output components yz and yı 
correspond to the impulse responses wz and w; respectively. By substituting the first 
equation into the second, using the previous result for w; and taking into account the 
fact that the input signal is a one dimensional distribution, we obtain an equation in 
ez 
2 2 
z2 = —ez = hy * (g1 * e1)? + hy * (g2 x Cf + gi * e2). 


whose solution is 
e = = (687 + hı * gi)* x (ha x we? + hy * 29 * ef) š 
With e2 and the previous results for e; and w; the second order impulse response is 
thus given by 
w = g2 * e8 + gi * e. 
Its Laplace transform is 
W2 (s1, 82) = G2(s1, $2) Ei (81) E1 (s2) + Gils + 52) E2(s1, 52) 
with 


E2(s1, 52) = 
(81, 82) Wy (81) Wi (82) + Hy (s1 + 52)G2(S1, 52) Ey (81) Ey (s2) 
1+ Hı (sı + 52)Gy(s; + 52) ` 


Combining these expressions and using previous results, we can write Wz in the 
following form 


W2(s1, s2) =| Eis + 52)G2(s1, 92) 


— Wi(s1 + 82) Ha(61, 82)Gi(61)Gi(s2)} EGE) (10.7) 
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Fig. 10.4 Signal flow graph xG k>2 
of a weakly nonlinear system kia 
with feedback 


œx Ay, k > 2 


which is easily interpretable with the help of the signal flow graph (SFG) shown in 
Fig. 10.4 (see Appendix A). 

The first term is composed by the transmission of the input signal—we think 
of it as composed by k tones — through the linear system to node E, the input 
of the nonlinear subsystem G. This part of the signal flow is represented by the 
factor E (s1) E1 (s2). The second order nonlinearity of G then generates a new tone 
as determined by G2(s1, 52). This newly generated tone is represented in the SFG 
by a source node because it is different from the input ones. The propagation of the 
new tone to the output of the system is accounted for by the last factor, E1 (s1 + s2). 

The second summand in (10.7) has a similar interpretation. The input signal first 
propagates through the linear system to the input of the other nonlinear subsys- 
tem H. This part of the signal flow is represented by G; (s1) E1 (s1)G1 (52) £1 (52) = 
Wi (s,)W,(s2). The second order nonlinearity of H then generates a new tone as 
determined and accounted for by the H>(s,, s2) factor. Finally, the new tone propa- 
gates to the output of the system, contributing the last factor, — W; (s1 + s2). 

We now proceed with the calculation of the third order impulse response of the 
system. The procedure is similar to the one used for the computation of the second 
order one. From 


y3 = g3 * e8? + 2g * [e1 Q erlym + 81 * 63 
z3 = h; * yp? + 2h2 * [Y1 Q Y2lsym + hı * y3 


and the previous results we obtain an equation for e3 


z3 = —€3 = hzx (g1 * e1)?’ 
+ 2h, * [(81 * e1) Q (82 * eP? + g1 * e2)] m 
+ hy * (g3 * e®? + 2g * [e] ® ezlsym + 81 * e3) 


whose solution is 
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e3 = (8% + hy * g1)" x {hs * (g1 * e1)%3 
+ 2h * [(81 * e1) Q (82 * eF? + g1 * Cr) Jom 
+ hy * (g3 * e8? + 2g * [e1 Q erlyn). 


The third order impulse response is obtained by inserting this expression for e3 and 
the previous ones for e; and ez into 


w3 = g3 * e8? + 2g x [e] 8 ezlsym + 81 * 63. 


As we find the expressions more easily interpretable, we perform this calculation in 
the Laplace domain. The Laplace transform of the last expressions for w3 and e3 are 


W3 (s1, 82, 3) =G3(S1, 82, 83) E1 (s1) Ey (82) E1 ($3) 
+ 2 [G2(s1, 82 + 53) E1 (51) E2(s2, 53) sym 
+ Gi(s1 + s2 + 53) F3(81, S2, 53) 


and 


—1 
1+ Hi (sı + 52 + s3)G1 (s1 + s2 + 83) 


[n (s1, 52, 83) W1 (s1) Wy (s2) Wj (53) 


E3 (s1, S2, 53) = 


+ 2| Ha(si.s2 + s3)W1 (sD [G2(52 83) E1 (62) E183) 
+ Gi (s2 + 53) E2(s2, s3)]| 


sym 


+ Hi(sı + s2 + s3)[Ga(s1, $2, 83) Ey (81) E1 (s2) Ey (83) 


+ 2[Go(s1, s2 + 83) E1 (51) E2(82, som] 


respectively. Combining these and previous results we can express W3 as follows 


W3(s1, $2, 83) = Ey (81 + s2 + 83)G3(S1, 82, 83) Fy (81) E1 (s2) Ey (83) 
—W, (s1 + s2 + $3) H3(51, 52, 53) W1 (81) W1 (52) Wj (s3) 
+2 W1 (81 + s2 + s3) A2 (81, s2 + 53)Wi (51) 


[w (s2 + 83) H2 (s2, 83) Wi (82) Wi (s3) 
-E1 (s2 + 83)Go(82, 53) E1 (82)E1 (s3) | 


—2E1 (s1 + s2 + $3)G2(81, 82 + 853)E1 (s1) 


sym 
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[A (82 + 83) E1(s2 + 53)G2(82, 53) E1 (82) £1 (83) 


+E (62 +53) Ha(s2,83)Wi(s2)Wilss)] 10.8) 


sym 


While this expression is rather long, it can be readily interpreted with the help of 
the SFG of Fig. 10.4. The first term is composed by the factor Fj (s1) E1(s2) Fi (s3) 
representing the input signal propagating through the linear part of the system to the 
input of G. The third order nonlinearity of G then generates a new tone as witnessed 
by G3(s1, s2, 53). Finally, the newly generated tone propagates through the linear 
part of the system to the output, E1 (s1 + s2 + 53). 

The second term has a similar structure and represents the contribution to the third 
order nonlinearity of W by the third order nonlinearity of H. 

The next summand represents the mixing of the second order nonlinear component 
of H with the input signal in the second order nonlinearity of H (again). Specifically, 
thinking of the input signal as composed by three tones, the factors W1 (s1), Wi (s2) 
and W; (s3) represent the input tones propagating through the linear part of the system 
to the input of H. There, the second and third tones pass through the second order 
nonlinearity of H generating a new second order tone, H>(s2, 53). This second order 
tone then propagates through the linear part of the system to the input of H, — W1 (s2 + 
53). There the second order tone and the first input tone pass through the second 
order distortion of H together and generate a new third order tone as witnessed by 
2 (51, s2 + s3). Finally, the third order tone propagates through the linear part of 
the system to the output, — W1 (s1 + s2 +53). 

The remaining summands have all a similar structure and interpretation as the one 
just described. They describe the first input tone mixing with a second order tone. 
The difference between them lies in which subsystem generates the second order 
tone and which one mixes the first tone with the second order one. 

Higher order impulse responses and nonlinear transfer functions of W can be 
obtained in a similar way. While the expressions become long, they can easily be 
computed with the help of computer algebra systems (CAS) computer programs and, 
referring to the SFG in Fig. 10.4, can be interpreted without difficulty. 

From the first three nonlinear transfer functions of the feedback based system W 
we can draw the following conclusions. 


The nonlinear transfer functions of the constituting subsystems play the role of 
controlled sources. 

The linear part of the subsystems plays a pivotal role. It describes the propagation 
around the system of all input and generated signals. 

The system W can have an impulse response of order k different from zero even 
if the impulse responses of order k of both subsystems G and H are zero. In 
particular, we saw how a third order nonlinearity can be generated by various 
combinations of the nonlinearities of second order of G and H. 

The nonlinear terms generated exclusively by the forward subsystem G can be 
suppressed by making the magnitude of the loop gain |H; (s)Gı(s)| large (in a 
suitable portion of the spectrum). This is so because all such terms in the nonlinear 
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transfer function of order k are proportional to 
Ey (s1) +++ Ey (sk) Ey (81 + +++ + 5x) 


and, as the loop gain is made large, E; becomes small. 

The nonlinear terms generated exclusively by the feedback subsystem H are not 
suppressed by making the magnitude of the loop gain | H(s)G,(s)| large. That’s 
because none of these terms are proportional to the linear component of the error 
signal FE). Instead, they are all proportional to 


Wi (51) --> Wills.) Wi (s1 +--+ + sk) 


which doesn’t necessarily become small as the loop gain is made large. 
Nonlinear terms generated by combinations of nonlinearities of G as well as of 
H include factors in E; and therefore do experience some level of suppression at 
large loop gains. 


Example 10.3: Linear Feedback 


As a special case we consider a system with linear feedback. This means that all 
transfer functions of H are zero, except for Hı. In this case the second and third 
order nonlinear transfer functions of the system are 


Go(51, 52) FE (51) E1 (s2) 
1+ Ay(s; + 52)Gi(s; + s2) 
= Ey (sı + 82)Go2(s1, 82) E1 (51) Ei (82) 


W2(s1, 52) = 


and 


W3(s1, 52, 83) = Ei (s1 + $2 +53){Ga(o, $2, $3) 


— 2[Ga(s1, s2 + 53) Hi (82 + 53) E (s2 + 83) G2(s2, 53) asm 
< Ey (81) Ey (82) £1 (s3) 


respectively. Both of them are proportional to 


Ey (81) +++ Ey (sx) Ei (s1 + +++ + sk) 


and can therefore be suppressed by making the loop gain large. 
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Fig. 10.5 Signal flow graph 1 E G 1 
for the system of X(sı) -° > a, + = . Y 
Example 10.4 


Example 10.4 


We revisit Example 9.5 again. Here however we replace the initial condition yod by 
a generic input signal x so that the system equation becomes 


(D6 +45) * y =x + cy’. 
Using (10.1) we can rewrite the equation in the following form 
y = (D8 + aô)! * (x + cy’) 
which can be interpreted as describing a linear system with nonlinear feedback. The 
problem can therefore be recast as the problem of finding the nonlinear transfer 


functions of a system W constituted by the forward subsystem G with linear transfer 
function 


Gı(sı) = PE 
1 


and a feedback subsystem H described by the second order nonlinear transfer func- 
tion 
Hz(s1, s2) = —c 


as shown in Fig. 10.5. Note that we have assumed negative feedback for consistency 
with our general treatment. This last expression is obtained by specialising the general 
expression cy? to an input signal having only a one dimensional component yı 


2 2 2 2 
cyi = cy? = cô? x ye, 


The obtained expression clearly describes a system whose only impulse response 
differing from zero is the second order one hy = cô8? (see also Example 10.2). 

In this formulation of the problem the solution is found by inserting the above 
expressions for the transfer functions of the subsystems into Eqs. (10.7) and (10.8). 
The obtained expressions obviously agree with the ones obtained in Example 9.5 by 
calculation from the convolution equation. 
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10.3 Linearisation 


Many systems are designed based on the theory of linear systems and the deviation 
from linear behavior in practical implementations is undesired. For this reason in 
practical implementations one often tries to minimise the responses of order higher 
than one. In this section we investigate the possibility of suppressing higher order 
responses by preceding the system in question H with another system G or by 
following it with a system K. 

We call a system K designed to suppress all nonlinear transfer functions of K o H 
up to order k a post-lineariser of order k and a system K suppressing all responses 
of K o H of order higher than one a post-lineariser. Similarly, we call a system G 
designed to suppress all nonlinear transfer functions of H o G up to order k a pre- 
lineariser of order k and a system G suppressing all responses of H o G of order 
higher than one a pre-lineariser or pre-distorter. 

We first investigate post-linearisers. The first requirement is that the system K 
should not change the linear response of H. This is only the case if the linear impulse 
response of K is a Dirac impulse 

k=6. 


Next, we look for a condition to suppress the response of second order. Referring to 
Table 10.2 we see that the second order response of K o H disappears if 


(k o h)a = ky * ha + ko * h? =0. 
Therefore, if hı has an inverse, we can make (k o h) disappear by choosing 
ky = —h, * (h8! . (10.9) 
In the Laplace domain this is 


Hz(s1, 82) 


Kals s) = -Ae 


(10.10) 


Next we look for a condition to suppress on top of (k o h)z also (k o h)3. Referring 
again to Table 10.2 we find the following condition 


(k o h)3 = kı * h3 + 2ky * [hy ® halsym + ks * h? = 0. 


As for the second order, this equation can be solved for k3 only if hı has an inverse, 
in which case, using the previously obtained values for kı and k2, we find 


k3 = (—h3 + 2h * (hP)! x [Ay ® halsym) * (AP! (10.11) 


with Laplace transform 
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Ay (81 ,52+83) 
—H3(51, 82, 53) +2 | Ba H (s7, sda 


K3 (s1, s2, 53) = (10.12) 


Hı (s1) Hı (s2) Hı (s3) 


This procedure can be extended to find the transfer functions of K up to order 
j such that they cancel the nonlinear responses of K o H up to the jth order. The 
condition for the existence of k; is always the same: the existence of the inverse 
of hı. This is so because in each equation (k o h); = 0, kj appears convolved with 
ne . If we let j tend to infinity we obtain a post-lineariser suppressing all nonlinear 
responses of H. 

The impulse responses of a pre-lineariser G can be obtained following a similar 
procedure. To preserve the response of H, its linear response must be a Dirac impulse 
as for a post-lineariser 

gi =ô. 


The second order response of H o G disappears if 
g2 = ht! x hz (10.13) 


or, expressed in the Laplace domain, if 


Hz(s1, 82) 

G(s}, s2) = ————__ . 10.14 
2(S1, 52) HG Es ( ) 

The third order response of H o G disappears if 

— px-l *—1 
g3 = hy! x (—h3 + 2h2 *[5@ (hy | * ha) | m) (10.15) 
whose Laplace transform is 
— 3 (s1, 52, 53) +2 [ Hs, 52 + $3) -N 

G3(S1, 52,53) = (10.16) 


Hı (sı + S2 + 53) 


and so on. Again, the prerequisite for the existence of these solutions is the existence 
of the inverse of h;. Note also that in general the transfer functions of a pre-lineariser 
are different from the ones of a post-lineariser. 

In summary, we can state that a weakly nonlinear system can be linearised with a 
pre- or a post-lineariser only if its linear transfer function has a stable inverse in the 
convolution algebra of interest. In the convolution algebra of right sided distributions 
this means the existence of a causal and stable inverse. 

A generic linear system may not have an inverse. For example, if the linear impulse 
response hı is a right-sided, indefinitely differentiable function then hı x w is an 
indefinitely differentiable function independently from the choice of w. This means 
that hı x w = ô has no solution and hence h; has no inverse. 
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A class of systems of special interest to us is the class of causal systems whose 
transfer functions are rational functions 


For this class of systems H; (s) is stable and has a causal stable inverse if all poles 
and zeros of H; (s) are in the left-half of the complex plane. 


Example 10.5: Memory-less System Linearisation 


In this example we consider a third order memory-less system H with impulse 
responses 
hi=aô hyn =0 hg = —a,d®. 


We would like to find a pre-lineariser G suppressing the responses of third order. 
The linear impulse response of the system has an inverse 


1 
ajo* —ô = ô. 
a 


Therefore it can be linearised using the results of this section. As hz = 0, the second- 
order impulse response of the pre-lineariser must also vanish 


g2=0. 


The third order impulse response of the pre-lineariser is obtained by applying (10.15) 
and we find 


1 a 
83 = — 8 x a38? = 2883, 
dı ai 


Note that while the pre-lineariser suppresses responses of third order, it does 
introduce responses of higher order 
a3 


hog = aiô * (8 + 28%) — m8 x (6 + 3523)3 
dı ai 


2 3 4 
a a a 
3-2885 — 3-2607 — 35%. 
a, ai aj 


= aiô 


It’s easy to see that to suppress the nonlinear responses up to order k the pre-lineariser 
must be of order k. To suppress them all a full pre-lineariser is needed. 
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10.4 System Manipulations 


In this section we highlight some properties of weakly nonlinear systems that allow 
us to manipulate weakly nonlinear system composed by sub-systems in such a way as 
to obtain different interconnections of the sub-systems without changing the behavior 
of the overall system. 

The first property that we discuss is the associativity of addition which comes 
from the fact that Dayi is a vector space. Thus if f, g and h are three weakly 
nonlinear systems driven by the same input signal x, the ways in which the outputs 
are summed is irrelevant 


(f[x] + glx) + Alx] = fix] + (8x) + AL) = fix] + glx) + hix]. 
The same is true for the product of the output signals 


(FLx] - glx) - ALx] = fix]: (gb) - Ale) = fix]; gle) - hix]. 


/ 


@,sym 1S defined in terms of 


This is the case because the product that we defined on D 
the tensor product and the latter is associative. 
A second important property is commutativity. Addition is always commutative, 


therefore the order in which the signal appears as input to adders is irrelevant 
Fix] + glx] = glx] + fix]. 


While the tensor product is not commutative the symmetrised tensor product is and 
with it the product in Dg sym 


fix]: glx] = glx]; fix]. 


Thus the order in which the signals appearing as input to multipliers is irrelevant 
as well. In fact, because it’s cumbersome to draw symmetrised block diagrams, we 
will generally draw unsymmetrised block diagrams and, if not stated explicitly, imply 
symmetrisation. 

A further equivalence of block diagrams comes from the distributivity of the 
product over addition 


(Fix] + glx): hlx] = fix]; Ale] + gix]: hix], 
Fix]: (glx) + Ax) = fix] glx] + fix] hix]. 


This property originates from the multi-linearity of the tensor product. A block 
diagram representation of the first equality is shown in Fig. 10.6. 
Another equivalence is given by the equation 


(go Pix] + h o Pix] = (8 + h) o flix]. 
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To prove the validity of this equation we prove its validity for terms of each order indi- 
vidually. To simplify the expressions, let’s denote the sum of all /th tensor products 
resulting in a distribution of order k by 


oe ee Nei 


la|=l 
|xa|=k 


with œ a multi-indexe in N* and « = (1, 2,..., k). With this notation the kth order 
impulse responses of the summands on the left-hand side can be written as 


k k 
Gofp=) eas”. dhofpa > hay”. 
{=l l=1 


The two can be combined using the distributivity of convolution (3.13) to obtain 


k 
Ye + hj) * o 


l=1 


which is the kth order impulse response of the expression on the right-hand side. 
The last useful property in manipulating block diagrams is the right distributivity 
of composition 


(go Dix]: (ho f)[x] = (8: h) o flx]. 


We prove again this equality by proving its validity for terms of each order individ- 
ually. The impulse response of order k on the left-hand side is 


i j 
> > gi * Jo a hm * e ; 


i+j=k l=1 m=1 


Hence, dropping symmetrisation operators for simplicity of notation 


7s 7 
x ——| g y x —ẹ¢—>| h y 
= h = g 


Fig. 10.6 Distributivity of WNTI systems 
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ee m z m g 
x| y x gi y 
— f | h — h 


Fig. 10.7 Right distributivity of composition of WNTI systems. The empty circle represents either 
a sum or a product 


D D AG in Y= DT DY Gr @ hind O @ FF) 


i+j=k l+m<k i+j=k l+m<k 


k 
> YS (81 @ hm) * FO 


s=1 l+m=s 


which corresponds to the kth order impulse response of the right-hand side of the 
equation. A block diagram representation of the property is shown in Fig. 10.7. 


10.5 Structure 


A review of our development of the theory of weakly nonlinear systems up to this 
point reveals that weakly nonlinear systems arise out of stable linear systems and 
multipliers. In particular, multipliers are the only mean by which we can combine 
linear systems to produce systems of higher order.! In this section we investigate the 
overall structure of systems constructed this way. 

Let’s start by considering the most generic impulse response of second-order that 
can be constructed out of a single multiplier and linear systems h4, hg and hc 


h2(t1, T2) = [hc * (ha 8 hg)]sym- 


The block diagram of a system whose only impulse response is hz is shown in 
Fig. 10.8a. We call a system whose only impulse response is h; a monomial system 
of ith order. 

In Sect. 3.3 we showed that every distribution can be approximated to arbitrary 
accuracy by a set of weighted Dirac impulses. We can thus approximate the linear 
system hc by 


N 
hce(t) © end — An), Cn €C, Aq € [0, 00). 


n=0 


l In our formalism multiplication is represented by the tensor product. It is only at the end, when 
the output signal of interest is “evaluated on the diagoman” with the operator evq() that the tensor 
product collapses to a multiplication. 
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= ha 
x — he >y 
= hg 
b = ha 
m 6(t — Ao) 
> hpg 
x — Q— y 
= ha 
= O(T An) 
= hp 


Fig. 10.8 a Block diagram of the most generic monomial system of second-order constructed with 
a single multiplier and linear systems h 4, hg and hc. b Approximation of the system in Fig. 10.8 a 


where we assume the system to be causal. Using this approximation in h2 we obtain 


N 


ha(t1, T2) © X [end(t1 — dn) * (ha(t1) O he (2) ] 


n=0 


sym ° 


The shifting property of convolution (3.16) extends to convolutions between dis- 
tributions of different dimensions in a similar way as the differentiation rule (10.6). 
In particular for the one dimensional convolution fı and the ith dimensional one g; 
we have 


fii — À) * gi(t1,..., Ti) = fi (T1) * giti —A,..., Ti — À). 


Using this property the response of the system can be expressed as 
yo(t1, T2) = h(t, T2) * (x (T1) Q x(T2)) 


N 
~ $ cn [hA Cti) @ hg (T)lsym * (x(t — An) Q x (T2 — Àn)). 
n=0 


This shows that all delays required to approximate hç to any desired accuracy can 
be moved to delays of the input signal as illustrated in Fig. 10.8b. 
If we use a similar approximation for h4 and hg we obtain 
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x | 6(t — A) | 6(t — A) || o(t -A) 


memoryless sub-system composed 


by adders and multipliers p 
Fig. 10.9 Conceptual structure of a WNTI system 
Y2 (T1, T2) 
Ne Ny Na 
x Xo > > Cn, [anbu Jom C = À (na +F Ne)) ® X(T = A (np F nc)) 


nc=0 na=0 np=0 


where we have assumed the use of equal and uniform delays for all sub-systems. 

Monomial systems of higher order can be constructed in a similar way by com- 
bining linear systems and more multipliers. If we approximate all linear sub-systems 
as we did above for the second-order monomial system, it’s easy to see that all delays 
can be moved to the input of the system. A system of order K is the sum of monomial 
sub-systems of order up to K . Therefore, weakly nonlinear systems of finite order can 
be represented as composed by two sections: An input tapped delay line sub-system 
that represents the memory of the system and a memoryless sub-system composed 
by adders and multipliers as illustrated in Fig. 10.9. 

An estimate for the maximum delay necessary to faithfully represent a given 
system of order K can be obtained from the sampling theorem (see Example 12.5): 
If the maximum frequency component of the input signal is fmax, then the highest 
frequency at the output of the system is K fmax and the delay must be bounded by 


À < a 
2 K di max 
The number of taps depends on the amount of memory of the linear sub-systems to 
be approximated. 
The system structure represented in Fig. 10.9 is not the most economical one. 
A comparison between Fig. 10.8a and b reveals that if one moves all the system 
memory to the input of the system then one needs a larger number of multipliers 
than by distributing the memory across sub-systems. This is entirely analogous to 
the trade-off in the implementation of discrete time filters as finite-impulse response 
(FIR) versus infinite-impulse response (IIR) filters. 
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Chapter 11 A) 
Weakly Nonlinear Time Invariant get 
Circuits 


The aim of this chapter is to show the utility of the theory that we developed. This is 
done by applying it to the analysis of nonlinear effects, that is of deviation from linear 
behaviour, in analog circuits. The vast majority of analog circuits are limited by noise 
on the bottom end of their dynamic range and by nonlinear effects on the upper end. 
While the analysis of noise is well understood by practising engineers, the analysis 
of nonlinear effects is much less so, and their minimisation poses great practical 
challenges. The applications presented in this chapter are therefore of practical utility. 

The components serving as the building blocks of analog circuits can be repre- 
sented by linear elements and controlled sources representing nonlinear behaviour. 
The total response of the circuit can be calculated from a hierarchy of electrical net- 
works with the familiar small-signal linear network forming its core. The hierarchy 
of networks is constituted by the linear core driven by sources of increasing order. 
This can be seen as a specialisation to electrical networks of the signal-flow graph 
method that we saw in Sect. 10.2. 

Analog electrical circuits are operated around a stable equilibrium point called 
the (quiescent) operating point of the circuit. The dynamic variables of interest in the 
theory of weakly nonlinear systems are the ones describing the deviation from the 
operating point (see Sect. 9.1). We call such variables small-signal (or incremental) 
variables. In the following, to distinguish the incremental part of a quantity from the 
total quantity, we will adopt the notational conventions summarised in Table 11.1. 

In Sects. 11.2 and 11.3 of this chapter we develop equivalent circuits for electronic 
components allowing us to model arbitrary weakly nonlinear analog circuits. In the 
remaining sections we study concrete circuits used in many types of systems and 
in particular in communication systems. Before that, in the following section we 
review a few standard metrics used to characterise the nonlinear behaviour of weakly 
nonlinear analog circuits. 
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Table 11.1 Definition of symbols used for various quantities 


Definition Quantity Subscript Example 
Total quantity Lower-case Upper-case UC 
Operating point Upper-case Upper-case Vo 
Small-signal quantity Lower-case Lower-case Ue 

kth component of the Lower-case Lower-case and index k Ve.k 


small-signal quantity 


Laplace transform of the kth | Upper-case Lower-case and idex k Vek 
component of the 
small-signal quantity 


11.1 Metrics for Nonlinear Effects 


It’s common to distinguish between two classes of nonlinear effects. The first is 
characterised with input signals of large magnitude, the compression characteristics 
being the archetypal example. The second is characterised using small signals with 
intermodulation as the archetypal example. In the following we analyse these and 
related effects. 


11.1.1 Gain Compression and Expansion 


Gain compression and gain expansion refer to the change in the gain experienced 
by a signal passing through a weakly nonlinear system as the amplitude of the input 
signal changes. At sufficiently large input signal levels all electronic circuits exhibit 
saturation. However, at the onset of deviation of gain from the small signal value, we 
may observe a gradual gain reduction, referred to as gain compression; or some gain 
increase, referred to as gain expansion (see Fig. 11.1). Which of these effects occurs 
and at which signal level depends on the nonlinear characteristics of the system. 


Fig. 11.1 Output signal 


magnitude versus input |A,| (dB) 
signal one of a typical 1 dB i 
weakly nonlinear system 
slope = 1 
pe 


lAlhap  lA;l| (dB) 
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Consider a weakly nonlinear system H driven by a sinusoidal signal 
x(t) = |A;| cos(@;t + 91) = Nfa e] , 


As discussed in Sect. 9.8.2 its output is composed by tones at w; and at integer 
multiples of it, the harmonics. Let’s denote by ye, the sum of all the terms at a, 


Yo, (t) = [Aol cos(ait + Wi) = Rf Age!" 
= yi on H + Y3,0,.90 + 5,0, oe Fars 


=f [fron +3 |Ail? Asa, ats TAT ĥs as + Jaer}, 


From this expression we see that ya, is proportional to the input signal. Therefore, 
in a similar way as we do with LTI systems, we can consider the ratio of the output 
signal phasor to the one of the input signal and obtain a sort of frequency response. 
However, differently from the frequency response of linear systems, the obtained 
ratio is a function of the input signal amplitude and is called the describing function 


Ao 
KUAL oD = 9 = ront |A; faan t> |A;|* fso +- - (1.1) 


Its magnitude is called the gain of the system 


| Aol 
[Ai] 


G(|Ai|,@1) := |K (Ail, | = 


At sufficiently small input signal levels, at the onset of nonlinear behaviour, the 
third order nonlinearity usually dominates and the describing function can be approx- 
imated by 


à ah 
K (AI. o) © hiesy: (14 Ža $02). (11.2) 
hyo.) 


Note that we have factored the linear frequency response to obtain an explicit factor 
representing the deviation of the system’s behaviour from the one of a perfectly linear 
system. This factor can be visualised in the complex plane as the sum of the vector 


= = a hs.a2 
2, +75; : = TIA? SAUD, 
hiron. 


and the unit vector 1 (see Fig. 11.2). If the angle of & is around 0° then the two vectors 
point approximately in the same direction. Therefore, as the amplitude of the input 
signal grows, the magnitude of the output signal grows faster than linearly and the 
system exhibits gain expansion. If the angle of & is around 180° then the two vectors 
point approximately in opposite directions and the system exhibits gain compression. 
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Fig. 11.2 Visualisation of N 
K (Aj| , œ1)/ĥ1,0,1) as the sum 3 73.0.2) | 4 2 
sum in the complex plane of 4 hy,0,1) 

© and the unit vector 


If the angle is around +90° then the vectors are approximately perpendicular and the 
gain of the system is less sensitive to variations of the input signal (terms of order 
higher than third will become important). However, in this case it is the angle of the 
output signal that is sensitive to changes in the input signal magnitude. Such a system 
is said to exhibit amplitude-modulation (AM) tophase-modulation (PM) conversion. 

Let’s have a closer look at the gain of the system. The ratio of the system gain 
to the one of the system if it would be perfectly linear is called the gain compres- 
sion/expansion ratio and denoted by GCER 


GCER = lAl ev (11.3) 


lino. »| 


Using (11.2), at the onset of deviation from linear behaviour, it is given by 


GCER ~ ,/(1+ &,)? + &? 


52 
=(+8)/1+ Gay 


If we expand the square root in a Taylor series 


1 g2 
GCER ~ (1 + &,)(1 + -——— +.. 
(+68 + 5Gyeypt ) 


we see that, to first order, the GCER can be estimated by 


3 
GCER ~x 1+ = a isc, 2) ja; (11.4) 
4 1,(0,1) 


Given our small signal assumption, this expression should only be used to estimate 
gain compression or expansion up to ca. 1 dB. 

A standard linearity metric used to test analog circuits is the 1 dB compression 
point which is the signal magnitude causing the system gain to decrease by 1 dB. 
Equation (11.4) allows estimating the magnitude of the input signal producing a 
given gain compression or expansion 
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4| (GCER — 1) 

3] 5 | hsan ) l 
‘ hio 


If {Áz a, / h 1,(0,1)} is negative the 1 dB compression point can thus be estimated by 


|Aj| © (11.5) 


0.381 
Aan Ss (11.6) 


nf Boa | 
h1,0,1) 

For small input signals the phase change can also be calculated from the ratio 
K (\Ao| , @1)//1,0,1) 


T 
Gj AGA 
Eh 


? eai Ry 
= = 
1+6, 1+464, 


i- 


Aw, = arctan 


From this we can estimate the input signal magnitude producing a phase change of 
Avy; radiants by 


4) A 
[Aj] © |: (11.7) 
3 sfa] 
hi0, 


11.1.2 Intermodulation 


In Example 9.8 we analyzed the response of a weakly nonlinear system to a two tones 
input signal and found that it is composed by several tones at various frequencies. 
In the context of communication systems and analog circuit design all signal tones 
at a frequency that is not a multiple of one of the input frequencies are referred to 
asintermodulation products. An intermodulation product is said to be of order k and 
denoted by IMk if k is the lowest order nonlinearity able to produce it (see Fig. 9.8). 
For example, given input tones @; and œ, the tones at 2m; — œ and 22 — w]; are 
intermodulation products of third order (IM3); the ones at 3m, — 2m and 3w2 — 2a 
of fifth order (IMS). 

As an example showing the importance of controlling and limiting the strength of 
intermodulation products, consider a communication receiver designed for a specific 
service. Most communication services divide the allocated frequency band in equally 
spaced channels. Suppose that we are interested in receiving a signal transmitted by 
a distant transmitter on channel j. Suppose further that the receiver also receives 
relatively strong interfering signals on channels j + m and j + 2m destined to other 
users. If the receiver is not sufficiently linear then the two interfering signals will 
produce intermodulation products degrading and possibly completely masking the 
wanted signal. While the modulation of the involved signals plays a role, due to its 
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Fig. 11.3 Interfering signals Interferers 
causing the IM3 product to 
mask the wanted signal A | Ff{eva)}1 lAl 
4 [Aol 
f IM3 
IM2 > . 
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simplicity, communication receivers are also invariably benchmarked and tested with 


tones as shown in Fig. 11.3. 
Consider the two tones input signal 


x(t) = |Aj| cos(@ + p1) + |A2| cos(w2 + p2) = H{Are! + Aze} 


where we assume œw > @; > 0. The intermodulation product of order k characterised 


by the frequency mix m is 
E 1 k! mgn- amann ont 
Ykm) = i p HA A A, Az “hy me®'}. 


At relatively low input signal levels the strongest intermodulation products are 
the second and the third order ones with amplitudes 


Amaz = |¥3,¢0,1,0,1 ©) | = |41] |A2l a,0.1.0.0| 


Amz *= |¥2,0,0,1,4) | = [Ail |Aal zoon] 


3 à 
Amaz := |Y5,0,2,0,) | = Z |Ail? [Al lis.0.20.0| 


3 ale 
Av3H ‘= Y3,0,1,0,2) É) = A |A| |42] lâs.o0] : 


These expressions show that the IM2 products are proportional to the amplitudes 
of each of the two input tones while the IM3 products are proportional to the square 
of the magnitude of the closest tone and proportional to the magnitude of the more 


distant one (see Fig. 11.3). 
The standard intermodulation test is performed with two tones of equal amplitude 


|Ai| = |A2| =A. 
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Fig. 11.4 Second and third Pout(Om) Å 
order intermodulation dB) 4 ` OIP2 
intercept points (dB) 


1 dB 


Fund. 
(slope = 1) 


IM3 IM2 
(slope = 3) (slope = 2) 


T T _ 
IP3 mp2 P: (dB) 


In this case the magnitude of the IM product of order k is proportional to A‘ (remem- 
ber that |m| = k) 


Thus knowing the IMk product level at one value of A is enough to compute its 
value at a different value of A. This is of course only true at sufficiently small input 
signals, when the contributions to the IMk product of nonlinearities of order higher 
than k can be neglected. Instead of specifying the IMk at a specific value of A it 
is common practice to specify the intermodulation intercept point of order k (IPk). 
This is the level, extrapolated from sufficiently small values of A, at which the IMk 
reaches the same magnitude as the (linear) output of the system at w,, when driven 
by a single tone of magnitude A and frequency œm (see Fig. 11.4). The kth order 
intercept point is thus defined by the equation 


lk, 
lal 


hkm 


=Alin(on)| . 


Solving for the amplitude we find 


AyrpK A : (11.8) 


This quantity is also called the input referred IPk and denoted by IIPk. Sometimes it 
is more convenient to refer this quantity to the output of the circuit in which case it is 
called the output referred IPk and denoted by OIPK. Its value is found by multiplying 
the IIPk by the linear gain at w,, 
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Aork := fâi (on) Alp - (11.9) 


The second and third order intercept points are the most important ones and can 
be estimated by 


h m 
Am = Or) (11.10) 
2,m 
4| hy (Om 
Aup3 = ; Le, (11.11) 
3 h3m 


Expressed in decibels the IPk assumes a particularly simple form. To that end, 
let’s first rewrite the output referred IP k as 


Aor = [hi (on) “fmt A atom 
IPk — 1 m 


k! Ak him 
A fiton) 
= Ahi (@m)| | ———. 
Ak A lk, m 
Then note that 7 3 
(4 [fiom] 


is the output power of the fundamental tone normalised to a load of 1/2 Q. Similarly, 


(AT ln) 


is the one of the IMk product. Thus, if for a fixed and sufficiently small value of A 
we denote by P, the output power of the fundamental expressed in dB relative to 
some reference power and by Pim the one of the IMk product relative to the same 
reference level, then we can express the OIPk by 


hkm 


P,— P, 
OIPk = P, + Are (11.12) 


Similarly, by denoting the normalised power of an input tone by P,, the IHPk can 


be expressed by me 
IIPk = P, + i (11.13) 


These relationships are easily checked geometrically for the IP2 and IP3 in Fig. 11.4. 
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In memory-less weakly nonlinear systems for which, for every k, him is a real 
number c independent of m, the IP3 and the input signal level producing a gain 


compression/expansion of GCER are both proportional to (see (11.5)) 


c1 


C3 


Therefore, in this type of systems, these two quantities are proportional to each other 


20108 (ZE B) 20 log /|GCER — 


For a memory-less system exhibiting gain compression, the difference between the 
IP3 and the 1 dB compression point is 


A 
20 log (72) = 20 log V 1 — 1071/20 ~ —9.6 dB. 


TIP3 


11.1.3 Desensitisation 


The response of an LTI system to a signal is unaffected by the presence of a second 
signal. As long as we have a way of distinguishing the two signals, for example by 
separating them in frequency, we can ignore the presence of the second one. This is 
not the case in nonlinear systems where the response to one signal is affected by the 
presence of other ones. The effect is again most easily illustrated using a two tones 
input signal. 

Let H be a weakly nonlinear system driven by the two tones input signal 


x(t) = |Aj| cos(@ + p1) + |A2| cos(@2 + p2) = MA e! + Aze}. 


The first tone represents the signal of interest, while the second one is an undesired 
signal that is referred to as a blocking signal or a jammer. As discussed, the response 
of the system is composed by several tones at various frequencies, among which 
several at œw. As in Sect. 11.1.1, we denote by Ye, the sum of all terms at w1 


Yo, (t) = [Aol cos(@it + Vi) = RAe} 
= yi 0,0,1,0) É) + ¥3,0,1,2,0 4) + 5,0,0,1, A Saas 


-a| [hono |A? sono +5 |421? 3,1,0,1, + Jaee), 


and find again an expression that is proportional to the phasor of the first input tone. 
At relatively small input signal levels the contributions of order higher than third 
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can usually be neglected. In addition, we assume that the magnitude of the blocking 
signal is much larger than the one of the desired signal 


|Ai| < |A2| . 


Under these assumptions ye, can be simplified to 


AN 3 R 
Yo (t) X [foo + 2 JA21? faon Jae] : 


Following a procedure similar to the one that we used to analyse gain compression 
and expansion, we build the ratio of the output phasor to the one of the first tone 


te i 3 hsaon, 
XM (\A2|, @1) := T = hoo) «(1+ 5 142? aD) 
1 


1, (0,0,1,0) 


to obtain a sort of frequency response. Similarly to the approximation of the describ- 
ing function (11.2), it is the product of the linear frequency response of the system 
and a factor that characterises the deviation from linear behaviour. Differently from 
the describing function, however, this second factor depends on the amplitude of the 
second tone, the blocking signal. 

The ratio X M(|A2| , @1) /h1,0,0,1.0) can again be visualised in the complex plane 
as the sum of the unit vector and the vector 


A 


3 > aaoi 
z |A| a. 
hı (0,0,1,0) 


If the angle of the latter is close to 180° then the second tone will induce a reduction 
in the gain experienced by the first one. If the angle is close to 0° it will induce a 
gain expansion and, if the angle is close to +90° it will induce mostly a change in 
the phase of the first tone. The change in gain can be characterised by the magnitude 
of the above ratio, the desensitisation ratio 


XM(|A2l, 3 h 
pR= | a(i 21, @)} _ j +inf aiana (11.14) 
h1,,0,1.0) 2 1,(0,0,1,0) 
and to second order in |A2| can be estimated by 
3 h 
DRY 145 |an? n| at | (11.15) 


1,(0,0,1,0) 


From this expression we can estimate the magnitude of the blocker causing a certain 
wanted signal gain change 
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(DR — 1) 
A fs,a.o1.0, pe 
h1,0.0,1.0) 


If R{A3,1,0,1.1)/h1.0,0,1,0} is negative, a desensitisation of 1dB is produced by a 
blocker at the 1 dB blocking level 


|A2| © (11.16) 


0.269 
gf fuor hs, 13,0,0.1.1) | | 
hi 0.0.1.0 
The change in phase of the first tone caused by the presence of the blocker can 


also be estimated from X M (|A2| , @1)/h1,(,0,1.0). To first order a phase change of 
Aw; radiants is produced by a blocker of magnitude 


ABiaB = (11.17) 


Ayı 
sf pan | ` 
h1,(0,0,1,0) 
Note that if the blocker is modulated, then the modulation will be transferred 
from it to the wanted signal. For example, if the blocker is amplitude modulated 
(AM) and the angle of /3 (1,0,1,1)//1,(0,0,1,0) is close to either 180°or 0° then the gain 
experienced by the wanted signal is modulated and, as a result, its output amplitude 
will also be modulated. If the angle of /3 (1,0,1,1)//1,(,0,1,0) is close to 90° then an 
amplitude modulation of the blocker will produce a phase modulation of the wanted 


signal. This effect of transferring the modulation of one signal to another one is called 
cross-modulation. 


[A © (11.18) 


2 
3 


11.2 Nonlinear Two-Terminal Elements 


In this section we investigate two-terminal electrical components that can be char- 
acterised by two quantities xg and yg, related by an equation of the form 


fxe, yz) = 0 


with f a function called the element x-y characteristic (see Fig. 11.5). If the equation 
can be expressed as a function of xg, yg = f (xg) then the element is called anx- 
controlled device. Similarly, if it can be expressed as a function of yz, xg = f Oe) 
then it is called a y-controlled device. 
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an x-controlled two-terminal 
element 
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The devices that interest us are the ones that, in a region of interest around a 
quiescent operating point (Xg, Yg), are either x- or y-controlled and whose function 
f can be approximated to any desired accuracy by a power series 


[o0] 
J= 5 frx*  (x-controlled) 
k=1 
or 
[o0] 
Xe = >. fxys — (y-controlled) 
k=l 
with 
Ye = YE — YẸ, Xe = XE — XE. 


11.2.1 Nonlinear Resistors 


A nonlinear resistor is a device characterised by the current ir flowing through it, 
the voltage vr across its terminals and by an i-v characteristic fr(irz, vr) = 0. In 
the following we are going to represent a nonlinear resistor by the symbol shown in 
Fig. 11.6. A current controlled resistor can be characterised by a function 


Ur = r(ir) 


which, by assumption, around the operating point (Ir, Vr), can be approximated by 
a power series 


po 
v=) ri, = V=V,=UR—Ver, i=ir=irz—Tp. 
k=1 
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Fig. 11.6 Symbols used to a ; b 7 : c 
represent nonlinear IR IC IL 


two-terminal devices. a 
resistor. b capacitor. € UR —| | vc UL 
inductor Peps 


If we consider the current i and the voltage v as signals, or, more precisely, elements 
of De it then a nonlinear resistor can be regarded as a weakly nonlinear system 
and the components of v can be expressed in terms of the ones of 7 using (10.1) and 


Table 10.1 


v =ni 


; .&2 
v2 = rıl2 + ni? 


l S : (11.19) 
v3 = 7113 + 2r [ii ® i2|sym + 130) 


From this representation we observe that each voltage component vx is determined 
(i) by a term proportional to the kth current component i, and (ii) by other terms 
proportional to current components of order lower than k. In an electric network, the 
former can be represented by a linear resistor of value r,, the latter by a voltage source 
Upr,« Whose value is determined by the current components i,,n = 1,...,k — 1 (see 
Fig. 11.7a) 

vk = riik + UR KC, + ik—1) - 


The various current and voltage components can therefore be calculated using 
a hierarchy of linear networks. First, we find the linear current 7, using linearised 
components and the sources representing the system input. Once i; is found, ÙR,2 
can be determined. With it we can draw the second order network. It is obtained 
from the linearised network by removing the system input sources (since they are of 
first order), by adding the second order source Ŭg,2 and, if the case, the ones of other 
nonlinear components. With this network we compute i2. Having found the first two 
components of i, the third order source vr.3 can be calculated. We then proceed to 
draw the third order network which is again composed by the linearised network 
with the addition of independent sources of third order only. With it, we find i; and 
so on. 

If the nonlinear resistor is voltage controlled, then its characteristic around the 
operating point can be described by a power series where the role of the independent 
variable is played by the voltage v 


[o0] 
i=) go. 


k=1 
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Fig. 11.7 a Weakly nonlinear resistor current-controlled equivalent model b Weakly nonlinear 
resistor voltage-controlled equivalent model 


Proceeding as for the case of a current controlled nonlinear resistor, but with the 
roles of the signals i and v exchanged, we can express the first few components of 
the current i in terms of the ones of the voltage 


i) = gv] 


in = gi v2 + gov?” 
2 = give + govi m (11.20) 
13 = 8103 + 2g2 [vi ® V2 ]sym F 8301 


As before, each current component i; is the sum of a term linear in v; and other terms 
only depending on components of v of order lower than k 


ig = give HİR k(VI, ..., Ue-1)- 


From this representation we deduce the equivalent circuit shown in Fig. 11.7b. 

If a nonlinear resistor is voltage as well as current controlled, then we can choose 
the most convenient representation for the problem at hand. If one representation is 
known, then the other one can be obtained by power series inversion. For example, if 
we know the voltage-controlled representation, the current-controlled one is obtained 
by inserting the expression for the components given by (11.20) into (11.19) and by 
choosing the coefficients rg so that the equations are satisfied. Specifically, rı is found 
by solving 

i = giv = gin 


which gives 


ry = —. 


r is obtained by solving 


‘ @2 . +@2 2.82 
ig = g1V2 + gov, = gı (rii2 + rif ) + grii - 
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Using the previously obtained value for 7, the equation is satisfied if 
2 _ 
iro + gor; =0 


or 


r = 
2= 7G 
gi 
r3 is found in a similar way to be 
283 — 8183 
r3 = 5 s 
8} 


Higher order coefficients are easily calculated using the same procedure. 


11.2.2 Nonlinear Capacitors 


A nonlinear capacitor is a two-terminal device whose voltage vc across the terminals 
and the charge qc stored in it are related by a q-v characteristic fe(qc, vc) = 0. In 
the following, we are going to represent a nonlinear capacitor by the symbol shown in 
Fig. 11.6. A voltage controlled capacitor is a capacitor whose charge is a function of 
the voltage gc = fe (vc). Since the electric current is the time derivative of electric 
charge, if the voltage uc is a differentiable function of time, the capacitor current is 
related to the voltage across its terminals by 


_ dfewo) dve 


e= vc dr 


The slope of the q-v characteristic is called the small signal (or incremental) capac- 
itance of the nonlinear capacitor 


d fc(vc) i 


C(vc) := do 
c 


As before, we assume it to be expandable in a power series around the operating 
point (Qc, Vc) 


(oe) 


c(v) := C(u+ Vo) = > cepiv®, v=vc— Ve. 
k=0 


Using this expression in the equation for the current, we can express the latter as the 
following power series 
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[0,0] 


This last expression can be extended to currents and voltages represented by elements 
of D yar so that we take it as defining the relationship between current and voltage 
of a voltage-controlled weakly-nonlinear capacitor 


oe) 
icira) > Du. (11.21) 
k=1 


The first few components of the current expressed in terms of the components of the 
voltage are given by 


ii = c Dv; 


i2 = cı Dv + 2 Dy?? 
2 (11.22) 


3 123 
13 = cı Dú + & D [v ® V2lsym + zP" 


Each current component i, is the sum of a term linear in Dv, and others that only 
depend on the voltage components of order lower than k. In an electric network the 
kth component of the current can therefore be represented by a linear capacitor of 
value cı and a current source (see Fig. 11.8b) 


ig = c1 Dug Fickle Vki) 


The various components are calculated with the same hierarchy of networks that we 
described for nonlinear resistors. 

An initial charge qo on the capacitor can be represented as usual by a current pulse 
qoô applied across the capacitor in the linear convolution equation. 

Note that the linearised current-voltage characteristic of a capacitor is not by 
itself an asymptotically stable differential equation. The nonlinear transfer function 
formalism is therefore only applicable when the nonlinear capacitor is embedded in 
a network whose linear approximation is asymptotically stable. 

A charge controlled nonlinear capacitor is a capacitor whose voltage is a func- 
tion of the charge vc = ¢ (qc). Expanding this function around the operating point 
(Qc, Vc) we obtain 


[e6] 
v=} s4", v=vc-Vc, q=qc-Qc. 
k=1 
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The electric charge is the integral of the current. In the convolution algebra of right 
sided distributions this can be expressed by the convolution product between current 
and the Heaviside step function 


q(t) = f i(t)dt =1,(t) *i(t). 
0 


Substituting this equation in the preceding series we obtain a relation between current 
and voltage 


[e0] 
v=} ali. 
k=1 


The first few voltage components expressed as a function of the current components 
are given by 


v = 6114 * i 


v = 6114 * iz + oo(1y * i1)? 


v3 = çı 14 * i3 + 522[(14 * i1) O (14 * i2) J ym + 63(14. * i)e 


As for the previous cases we see that each voltage component vg is composed by a 
term linear in the current i, and other ones that only depend on current components 
of order lower than k 


vk = S114 * ik + VC KC, +, ik—1). 


This expression can be represented in an electric network by the equivalent circuit 


shown in Fig. 11.8a. 
If a capacitor is voltage controlled as well as charge controlled, then one can 


use either representation and one can be converted in the other one. The following 


Yk b i 
= c= + k 


Uk 7 Sı 
es : : D nG i Ul, .. -, Uk—1 
C) l Sox(i,..-sik-1)  * i € cxl k-1) 


Ò Ò 


Fig. 11.8 a Weakly nonlinear capacitor current-controlled equivalent model b Weakly nonlinear 
capacitor charge-controlled equivalent model 
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equations give the first three coefficients of the charge controlled representation 
expressed in terms of the ones of the voltage controlled ones 


1 
çı = — 
Cl 
C2 
CoS, 
26 
c C3 
G3 = — m -7Zz. 
2e Bei 


They were obtained by the same inversion procedure that we used to relate the two 
representations of nonlinear resistors. 


11.2.3 Nonlinear Inductors 


A nonlinear inductor is a two-terminal device whose current iz and magnetic flux 
$L are related by the ¢-i characteristic fz(Lz,iŁ)= 0. In the following we are 
going to represent a nonlinear inductor by the symbol shown in Fig. 11.6. A current 
controlled inductor is an inductor whose flux is a function of the current ø; = fi (iz). 
The voltage across the terminals of an inductor is the time derivative of the flux. Thus, 
if the current is a differentiable function of time, the voltage is 


5, a li) diz 
LO di, dt” 


The slope of the -i characteristic is called the small signal (or incremental) Induc- 
tance of the inductor 7 
didi 
Liiz) := afr) (11.23) 
di L 
that we assume, around the quiescent operating point (®;, Iz), to be expandable in 
a power series 


[0,6] 
li) := L(i + I) = X leni", i=i— h. 
k=0 


It is apparent that inductors and capacitors are “dual” of each other, with the roles 
of current and voltage exchanged. We can therefore adapt previous results and define 
the voltage and current relationship of a current controlled weakly nonlinear inductor 
by 


l 
v=vwc =} 7 Di". (11.24) 
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Fig. 11.9 a Weakly nonlinear inductor current-controlled equivalent model b Weakly nonlinear 
inductor flux-controlled equivalent model 


The first few components of the voltage expressed in terms of the components of 
the current are given by 


v1 = l Dii 
l 
v = lı Dis + D 


(11.25) 


á . i b .&3 
v = l4 Di; + LD [i ® i2]sym + aot 


The component k has the form 
Ug = lı Dug + ULC, «++ 5 tk-1) 


from which we read the equivalent circuit shown in Fig. 11.9a. 
Similarly, the flux controlled representation of a weakly nonlinear inductor is 


[0,6] 
i= oles)", 
k=1 


with the first few voltage components expressed as a function of the current compo- 
nents given by 


ly = Q1ty * V1 
in = Q114 * v2 + 00(14 * v1)? 


is = Q114 * v3 + 022 [14 * v1) @ (14 * va) ] oym + 0314 * 0) 
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Fig. 11.10 Current source representation in the Laplace domain of nonlinearities of weakly non- 


linear elements 
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Fig. 11.11 Voltage source representation in the Laplace domain of nonlinearities of weakly non- 
linear elements 


The component k has the form 
ik = 0114 * Ue FIL elI., Vk). 


which leads to the equivalent circuit shown in Fig. 11.9b. 

As for nonlinear capacitors, the nonlinear impulse responses formalism can only 
be applied to circuits including nonlinear inductors when they are part of networks 
whose linear approximation is asymptotically stable. 
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11.3 Nonlinear Multi-port Elements 


Weakly nonlinear multi-port and multi-terminal elements can be represented by two- 
terminal elements and controlled sources. Therefore, in this section we introduce 
weakly nonlinear controlled sources. With them, we will have at disposal all the 
necessary circuit elements necessary to model arbitrary weakly nonlinear electronic 
components. 

A controlled source is a two terminal element whose voltage vcs or current ics 
is controlled by a control voltage vx or current ix, a quantity in another part of the 
electric network of which it is part. There are four types of controlled sources: 


e the voltage-controlled voltage source (VCVS), characterised by the equation 
vcs = u (vx); 

e the voltage-controlled current source (VCCS), characterised by the equation ics = 
8m (vx ) 3 

e thecurrent-controlled voltage source (CCVS), characterised by the equation vcs = 
rm (ix), and 

e the current-controlled current source (CCCS), characterised by the equation ics = 


As before, we assume that, around a quiescent operating point, the characterising 
function can be approximated to any desired accuracy by a power series. The incre- 
mental quantities can then be represented by elements of Dis sont For example, we 
assume that a VCCS can be represented by 


CO 

; k z A 

i=% EmkV” , i =ics — lcs, v= vyx- Vx. 
k=l 


The first three components of the current expressed in terms of the components 
of the voltage can be derived in the same way as we did for a voltage-controlled 
weakly-nonlinear resistor and are 


i) = 8m1 Yı 

i2 = gm1 V2 + 8m0” 

s &m1V2 T §m2V1 5 (11.26) 
i3 = 8m1U3 + 28m2 [v1 8 Val sym + 8m3] 


Note that each current component i; is the sum of a term linear in the kth component 
of the incremental control voltage vg and other terms that only depend on components 
of the voltage v of order lower than k 


ik = 8mkVk + İcs,k(V1, ..., Ue-1)- 
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Fig. 11.12 Equivalent O i 
ik 


circuit of a weakly-nonlinear 
Uk a 
© 8m1Vk © İR,k(V1, . - +» Uk-1) 


VCCS 
O R 


In an electric network a VCCS can thus be represented by a linear VCCS and indepen- 
dent current sources only depending on control voltage components of order lower 
than k. As for the two terminal weakly-nonlinear elements considered in Sect. 11.2, 
the linear term of a VCCS plays a special role. For this reason the quantity 8m1 
has been given a name: it is called the transconductance of the source. A two-port 
representation of a VCCS is shown in Fig. 11.12. 

The situation is entirely analogous for the other types of controlled sources. The 
coefficient of the linear term of a CCVS rm1 is called the transresistance, the one of 
a CCCS g; is called the current transfer ratio and the one of a VCVS u: the voltage 
transfer ratio. 


11.4 Low-Pass Filter with Nonlinear Capacitor 


In this section we investigate the low-pass filter (LPF) shown in Fig. 11.13a. When 
implemented in an integrated circuit technology, a considerable fraction of the circuit 
area is often occupied by the capacitor. Given that the price of integrated circuits is 
determined to a large extent by occupied area, to reduce the cost of the circuit, 
it is desirable to use a capacitor type with a high capacitance per unit area. The 
highest capacitance per unit area available in CMOS technologies is offered by MOS 
capacitors which however have a rather nonlinear characteristic. For this reason we 
investigate the effects introduced in the circuit by the use of a nonlinear capacitor. In 
particular, we are interested in the upper linearity limit set by the nonlinear capacitor 
and therefore assume the operational amplifier (OpAmp) to be ideal. 


11.4.1 Nonlinear Transfer Functions 


Under the assumption of an ideal OpAmp, the circuit of Fig. 11.13a can be represented 
by the small-signal equivalent circuit shown in Fig. 11.13b. Since MOS capacitors 
are voltage controlled, we represent the nonlinear capacitor as a voltage controlled 
device. Then, using Kirchhoff’s current law (KCL) the system equation is 


218 11 Weakly Nonlinear Time Invariant Circuits 


TORR D eE 


a = coo) |oo 


Fig. 11.13 a Active RC low-pass filter circuit b Active RC low-pass filter with ideal OpAmp 
model 


[o0] 
7 +c1Dv, = -i — > + Dot (11.27) 
k=2 
To highlight that the nonlinear part of the capacitor characteristic act as a source, 
we have moved that part of the characteristic to the right-hand side of the equation 
together with the source i, and collected all linear terms on the left-hand side. 

We solve the equation in the Laplace domain using the equivalent circuits that we 
developed in Sects. 11.2 and 11.3. The first order output voltage component V3.1 is 
obtained by replacing the nonlinear capacitor with an ideal capacitor with a value cı 
corresponding to the value of the nonlinear capacitor at the operating point 


Vo,1(81) 


R + cisı Vo 1 (51) = —1; (s1). 


Using a Dirac impulse as input signal, the first order transfer function is found to be 


Hi (s1) =.V, a 
181) = . Vol E 
mast roe 
with 
1 
w = ae 
3dB Rc 


the 3 dB cut-off frequency of the filter. 

Having found the first order output component of the voltage, we can calculate 
the equivalent source representing the second order nonlinearity of the capacitor 
AGE 52) (see Fig. 11.10). With a Dirac pulse as the first order input we find 


~ E 
Ie 2(81, 82) = z6 + s2)Hı (s1) Hı (s2). 
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Fig. 11.14 LPF 2nd order + 
equivalent circuit mite. a 
Vo,2(81, 52) R Se G) Ic,2(S1, $2) 


The second order transfer function is found with the help of the second order 
equivalent circuit. It is obtained from the first order one by removing the current 
source [,(s;), which is of first order, and by inserting the source 1.9(s, S2) repre- 
senting the second order nonlinearity of the capacitor. The second order equivalent 
circuit is shown in Fig. 11.14. Note how the current generated by second (and higher) 
order nonlinearity is injected into the input node of the filter. For this reason, as dis- 
cussed in Sect. 10.2, feedback is unable to suppress it. Since the variables in this 
network are of second order, we have to use the definition of the derivative for sec- 
ond order distributions (Eq. (9.14)). The second order transfer function is thus found 


to be 7 
H: ’ = SAE ’ 
2(81, $2) fet eRe 2(81, 82) 
c 
= z6 + $2) Ay (sı + s2)Hı (s1) Hı (s2). 


With the first two components of the output voltage we can compute the equivalent 
source representing the capacitor nonlinearity of third order (see Fig. 11.10) 


Ie 3 (S1, 82, 83) 


c3 

= (s1 + s2 + s3){c2 [H1 (s1) H2(s2, S3)]sym + zM (81) Hi (s2)H1 (s3)} 
and with it the equivalent circuit of third order. Using the definition of the derivative 
for third order distributions, the third order transfer function is found to be 

Hy(s1, 82, 83) = bta 
S ’ tad = Cc ’ 7 S 
3(S1, 82, 83 1464s ERG 3 (S1, 52, 83 
= (sı + s2 + s3)Hı (s1 + s2 + 583) 


2: 
(Z [H1 61) Hi (52) Hi (53) 52 + 83) Hi (82 + 53) leym 


c3 
$ zM (91) Hı (s2) Hi (s3)} . 


220 11 Weakly Nonlinear Time Invariant Circuits 


11.4.2 Second Order Intermodulation 


Having found the first three transfer functions of the filter, we can evaluate the impact 
of the nonlinearities in concrete situations. As a first situation, suppose that there is a 
strong modulated signal in the stop-band of the filter. If the even order nonlinearities 
generate strong IM products masking the wanted signal in the pass-band, then the 
filter is of little use. To have a first indication of the strength of this effect, we 
only consider the nonlinearity of second order and calculate the IP2. We model the 
modulated signal in the stop-band with two tones at w and w2. We further assume 
@2 > wı and 
AW := W — @| < @34B 


so that one of the IM2 products falls in the pass-band of the filter. The IM2 of interest 
is characterized by the frequency mix m = (0, 1, 0, 1) and thus by the frequency 
response 


C2 
H(— jwi, Jæ) = 3740H G Aw) Mi (— jor) Ai (Go) 


c R wag Ræ c Ro? 
5 Aot 3dB 3dB AG 308 
2 —jJa@, Jor 2 QO} 


With it the IIP2 and OIP2 are obtained from (11.10) 


tele = ( a y (11.28) 
all C2 Aw R2 @3dB : 
2 w] 2 
Vop ad (==) l 11.29 
OIP2 TAR | m ( ) 


We have denoted the two intercept points by Jjjp2 and Vojp2 to make it clear that the 
first characterizes the magnitude of the input current while the latter the magnitude 
of the output voltage. 

These expressions reveal that the more the blocker is in the stop band, the lower 
the IM2. This makes intuitive sense as the voltage generated across the nonlinear 
capacitor by the interfering signal is the smaller, the lower the capacitor impedance. 
Since we have assumed a voltage-controlled capacitor, a small voltage will produce 
small intermodulation products. The above expressions also reveal that the IM2 is 
not homogeneous across the pass-band, but it is stronger when Aw approaches the 
filter 3 dB cut-off frequency of the filter. 

The IP2 can also be expressed in a slightly different form. If we replace one 
occurrence of wag by 1/(c; R) in the above expression we obtain 


Ci or 


Vor © 2 (11.30) 


co | Aw wsap 
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Fig. 11.15 a Typical characteristic of an n-type accumulation-mode MOS varactor with a channel 
length of 0.2 um in a 40nm CMOS technology b Small-signal model coefficients of an n-type 
accumulation-mode MOS varactor with a channel length of 0.2 um in a 40nm CMOS technology 


This form highlights the value of the OIP2 as a function of the ratio of the linear 
capacitor coefficient to the coefficient of the second order nonlinearity. 

Figure 11.15a shows the typical characteristic of an n-type accumulation-mode 
MOS varactor [26] with a channel length of 0.2 um in a 40nm CMOS technology. 
Figure 11.15b shows the small-signal model coefficients normalized to the linear 
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Fig. 11.16 Simulated IM2 of the LPF with the capacitor having the characteristic shown in 
Fig. 11.15a and driven by two tones of equal magnitude at 48.75 and 51.25 MHz respectively 


capacitance cı as a function of the voltage across the capacitor. If we use such a 
capacitor biased at 0 V to implement a LPF with a cut-off frequency of 5 MHz to 
suppress a signal at 50 MHz modeled as two tones at 48.75 and 51.25 MHz respec- 
tively, (11.30) predicts that the filter will have an OIP2 of 


OIP2 ~ 44 dBV. 


For comparison we simulated the LPF IP2 numerically. To obtain a cut-off frequency 
of 5 MHz we used a resistor of 1 kQ anda nominal capacitance of 31.58 pF. The results 
of the simulation are shown in Fig. 11.16. The value of the IP2 agrees very well with 
the predicted value. The IM2 starts to deviate from the ideal slope of 2 at a level 
of the input tones of ca. -55 dBA. This means that at that level the contribution of 
higher order nonlinearities to the IM2 become important. A -55 dBA tone at 50 MHz 
passing through a linear LPF with a transfer function equal to H; (Jœ) and the above 
component values produces an output tone with a magnitude of approximately 


/2.10755/20 R PB y 951 mV. 
w 


The capacitor characteristic in Fig. 11.15a shows that a linear c-v approximation 
is only reasonably accurate up to this value. We thus see that a rough estimate 
of the range of validity of the approximation can be obtained by overlapping the 
approximation with the real characteristic. At larger positive and negative voltage 
levels the capacitor characteristic flattens out, and we can speculate that this is the 
reason for the slower increase of the IM2 at large signal levels. 
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Fig. 11.17 a LPF with back-to-back capacitors ideal OpAmp model b LPF with back-to-back 
capacitors 2nd order equivalent circuit 
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Fig. 11.18 Characteristic of two back-to-back n-type accumulation-mode MOS varactor each with 
the characteristic shown in Fig. 11.15a 


For many applications, such as in communication receivers, this IP2 is insufficient. 
One way to improve it is by using two equal nonlinear capacitors connected back- 
to-back as shown in Fig. 11.17a, each providing half of the required capacitance. In 
this way, when v, increases, the capacitance of one capacitor increases, while the 
one of the other capacitor decreases. 

One way to analyze this circuit is to consider the combination of the two capacitors 
as a single nonlinear capacitor with the effective characteristic shown in Fig. 11.18. 
Figure 11.19 shows that, with identical devices, cz is identically zero. Hence, the IM2 
is completely suppressed. 

Another way to analyze the circuit is to consider each capacitor individually. If the 
linear transfer function has to remain the same as the one of the original circuit with 
a single capacitor, then we must have c, 1 + Cy,; = c1. The second order network is 
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Fig. 11.19 Small-signal model coefficients of two back-to-back n-type accumulation-mode MOS 
varactor 


therefore composed by the same linear components as before, but now it includes two 
sources, each representing the second-order nonlinearity of one of the two capacitors 
(see Fig. 11.17b). The one of c, has the same reference direction as the one of the 
original circuit and has a value of 


x c 
I.,.2(s1, 2) = S + $2)Vo,1(81) Vo,1 (82) - 


The one of c, has the opposite reference direction and a value of 


> Cn,2 
Ie, 2(81, $2) = 5 (s1 + 52)(—Vo,1(s1))(—Vo, 1 (82)) 
Cn,2 
=a (s1 + 52) Vo,1 (81) Vo,1 (82) - 
As the two negative signs coming from v, = —v, cancel, the two currents flow in 


opposite directions and, if c,,2 = C)p,2 they cancel each other. 

Note that this cancelling effect of even order responses is quite general. Given an 
arbitrary even order frequency response him: the response of (even) order k to N 
input tones with phasors Aj, ..., An is 


1 k! m m m my f 
Yim) = nf 2k=1 AN i A_i Ai bpm Ay” fkm g) : 
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If the sign of all input tones is reversed, every phasor will be multiplied by e/”. As 
k is assumed to be even and |m| = k these factors will multiply to 1 


e77% = |: k even. 


For this reason the response remains unchanged, but with opposite reference direc- 
tion. Therefore, all even order harmonics and intermodulation products will be sup- 
pressed. 

In reality there are two limitations to the amount of canceling that is practically 
achievable. The first one is due to the fact that small unavoidable manufacturing 
imperfections make nominally identical devices slightly different. This effectis called 
mismatch. For this reason the coefficients of c, will be slightly different from the 
one of c,. Let’s represent the small variations due to mismatch in the following way 


Cp,1 = Cnom,1 + ACp,1 Cn,1 = Cnom,1 + ACn, 
Cp,2 = Cnom,2 + ACp,2 Cn,2 = Cnom,2 + ACn,2 
with 
cl 
Cnom,1 = z ACp,1, Ach] < Cnom,1 
C2 
Cnom,2 = z Acp2; Acn2 < Cnom,2 - 


Then, the two current sources 1.,.2(81, s2) and I,,.2(S1, s2) can be represented by a 
single source with the same reference direction of the former and a value of 


Acp,2 = Acn.2 


5 (81 + $2) Vo,1 (81) Vo,1 (82). 


I.,,2(81, 82) — Te,,2(81, 52) = 


The resulting network is similar to the one of the original circuit, the only difference 
being that the coefficient cz is replaced by Acp,2 — Acn,2. The OIP2 is therefore 


cı + Acy 1 + Acy1 
Acp,2 = Acn.2 


2 
al 


Vorr2,B28 © 2 


AW @3aB8 


C2 C1 w? 


~2| (11.31) 


ACp2 — Acn2 C2 AOW W3aB 


Vor? - 


| z 


ACp,2 = Acn2 


Compared to the original circuit the IP2 has been improved by the mismatch limited 
factor 
C2 
Ac p,2 T ACn,2 
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Fig. 11.20 Simulated IM2 of the LPF with the capacitor having the characteristic shown in 
Fig. 11.15a and driven by two tones of equal magnitude at 48.75 and 51.25 MHz respectively 


Figure 11.20 shows the results of simulations with two identical nonlinear capaci- 
tors, and for the case where ACp,2/Cnom,2 = —ACn,2/Cnom,2 = 0.01. In the latter case 
we observe the expected improvement of 


1 
z 201og( =) = 40 dB. 


ahog ( 0.01 


C32 ) 
0.02c2/2 


In the former, at input signal levels up to —65 dBA the value of the IM2 is limited by 
numerical noise. At larger input signal levels the simulation result is the product of 
the limited accuracy of the used numerical algorithm. 

The second practical limitation is constituted by the fact that the terminals of 
real components are often coupled to other nodes of the circuit. This coupling can 
be modeled with parasitic components connected to the terminals. The parasitic 
components of the positive terminal are often different from the ones of the negative 
terminal. In addition, parasitic components are often nonlinear. 

We conclude this section by noting that if the two tones are in the pass-band of 


the filter the OIP2 is 
ial 


@3dB 


2 
=2 11.32 
| ~ (11.32) 


coAwR 


Vor2 © | : 
2 


11.4 Low-Pass Filter with Nonlinear Capacitor 227 


11.4.3 Third Order Intermodulation 


In this section we investigate the situation where there are two interfering signals, 
one at œw; and a second one close to twice this frequency œw = 2w; — Aw so that the 
lower side-band IM3 falls in the pass-band of the filter 


2w — @ = Aw < 34B @1,@2 > @zag > Q. 


To characterize this situation we compute the IP3. 
The IM3 of interest is obtained from the third order transfer function 


F3(S1, 82, 83) = (81 + s2 + 83) Ay (s1 + s2 + 83) 


2 
l (5 [H (81) Hy (s2) Ay (83) (82 + 83) H1 (82 + $3) ley 


+ SH (51) Hı (s2) M ws}. 


evaluated at the frequency mix m = (1,0, 2,0). Setting sı = y@ 1, s2 = Jæ; and 
53 = — Jœ, the term enclosed in the symmetrization operator becomes 


Ay (Jæ) Hı (yo) A (—j@2) 


1 
(2) Qo) M201) +4) Coi + Aw) Hi (o + Ao))]. 


If we assume |w, — Aw| > w3qp and use the approximation 


—R -1 


Jæcı R = C1 


JOH (jo) © jo 


we can simplify it to 


-1 
Hı (jo) Ai (gai) Hı (Ons ; 
1 
Using these results we obtain 


H3 (Jwi, J@1, —J@2) 
2 


C C 
© Aol GAO) Hi (e1) Hi(yor) Hi(—Jon)|-5> + S| 
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The IIP3 and OIP3 are obtained by inserting this result in (11.11) 


4 (a@\c\)3 
In © 3 : : z (11.33) 
\ Aof? — real 
TETE aes | i (11.34) 
om3 ™ C3 1 (c\2 ` 
\ 3 Aw oag | £ — a2) 


These expressions reveal that the IP3 depends not only on the third order coefficient 
c3, but also from the second order one c2. The reason is the fact that second order 
intermodulation products are fed back to the input of the nonlinear component, where, 
in combination with the fundamental tones, they pass again through the second order 
nonlinearity. This is the effect that was discussed in Sect. 10.2 with the help of the 
signal-flow graph of Fig. 10.4 and the reason for c) being squared. The expression 
for the OIP3 highlights the fact that it is the ratio of the coefficients cz and c3 to the 
linear capacitance c; that matters. The expressions also reveal that the IM3 generated 
by second order and third order nonlinearities have either the same or opposite phase 


and that, if 5 
C3 J 
2= 42%), 11.35 

C1 ate ( ) 


the two cancel each other. 

In the previous section we discussed the fact that using equal nonlinear capacitors 
connected back-to-back eliminates even order components from the response of the 
system. This is not the case for odd order nonlinearities. To see this, we can draw 
the third order equivalent network of the filter with back-to-back capacitors. The 
equivalent sources representing the third order nonlinearities of c, and c, are 


a E 
I.,3(81, 82,53) = ao + s2 + s3) Vo,1 (51) Vo,1 (82) Vo,1 (53) 


and 
he Cn 3 
I, 3(815 82, 83) = ae + 82 + 83) Vo,1 (81) Vo,1 (82) Vo,1 (53) 


respectively, where we have considered that V,2(s2, 53) is zero. For equal capacitors 
Cp,3 = Cn,3 = €3/2, therefore, having the sources opposite reference directions, they 
combine to form a single source equivalent to the one of a single nonlinear capacitor 
with cp = 0. 

As an example, we consider again a filter with a cur-off frequency of 5 MHz 
implemented with a nonlinear MOS capacitor having the characteristic shown in 
Fig. 11.15a and biased at 0 V. At this bias point the ratios c2/cı and c3/c; are 1.73 
and —0.94 respectively. If the filter is driven by a tone at 15 MHz and a second one 
at 27.5 MHz (11.34) predicts an OIP3 of 
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OPI3 ~ 16.0 dBV. 


Note that in this example it is the second order nonlinearity that dominates the IM3 
as 


1 2 
~ 0.16 < (2) x 0.75. 
4 


Thus, using back-to-back capacitors improves the OIP3 up to 
OIP3g28 © 23.6 dBV. 


For comparison, we simulated the filter with the full nonlinear capacitor charac- 
teristic of Fig. 11.15a. The obtained IM3 as a function of the input tones magnitude 
is shown in Fig. 11.21. The figure also shows the IM3 obtained using back-to-back 
capacitors. In both cases the obtained IP3 is in good agreement with the above cal- 
culations. The IM3 starts to depart from a straight line with a slope of three at a 
level of the input tones of ca. —67 dBA. This corresponds to an output fundamental 
magnitude of ca. 0.2 V for the tone at w; and is close to the level at which the poly- 
nomial approximation starts to deviate significantly from the real characteristic of 
the capacitor. 

Further, we verified the occurrence of canceling between the IM3 produced by 
the third order nonlinearity with the one produced by second order. Figure 11.22 
shows the magnitude of the IM3 as a function of the bias voltage of the capacitor. 
The curve shows a clear notch at a bias voltage of ca. —0.19 V, the bias voltage at 
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Fig. 11.21 Simulated IP3 of the LPF with the capacitor having the characteristic shown in 
Fig. 11.15a and with two equal back-to-back (B2B) capacitors. The two input tones were of equal 
magnitude at 15 and 27.5 MHz respectively 
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Fig. 11.22 Simulated IM3 of the LPF with the capacitor having the characteristic shown in 
Fig. 11.15a as a function of the capacitor bias voltage Vo. The filter was driven by two equal 
tones of magnitude 0.1 at 15 and 27.5 MHz respectively 


which the coefficient ratios c2/c; and c3/c, satisfy the canceling condition expressed 
by (11.35). This notch disappears at large signal levels, where contributions to the 
IM3 from higher order nonlinearities become important. The curve also suggest that, 
to obtain the best linearity, one should use a large bias voltage bringing the MOS 
capacitor in strong inversion, where its capacitance becomes almost constant. 

Before concluding this section we investigate the case in which the two tones are 
in the pass-band of the filter. In this case the term in the symmetrization operator in 
F3(J@1, J@1, —J@2) is 


Ay (71) A, (7a) Ay (—j 2) 


; [21 (201) Hi (J201) + 4J (—01 + Aw)(—R)] 
= M (jo) Ai (Joi) Hi (Jo): [oi Fhy201) + (œ — Ao)R]. 
If we further assume that 2@, also falls in the pass-band of the filter it simplifies to 
-H (71) A (Jo) Ai (Jo): J Aor. 


The third order nonlinear transfer function evaluated at the frequency mix m = 
(1, 0, 2, 0) is therefore 


11.4 Low-Pass Filter with Nonlinear Capacitor 231 


50 
OIP3 (S/B2B) = 9.74 / 10.90 dBV 
a= 
0 
S 
m 
2 -50 
o 
2 
-100 IM3 ext. 
IM3 B2B ext. 
Fund. ext. 
I -aa 
IM3 B2B ——— 
Fund. ——— 
-150 
-100 -90 -80 -70 -60 -50 -40 
|I| (GBA) 


Fig. 11.23 Simulated IP3 of the LPF with the capacitor having the characteristic shown in 
Fig. 11.15a and with two equal back-to-back (B2B) capacitors. The two input tones were of equal 
magnitude at 1 and 1.1 MHz respectively 
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The results of a simulation with one tone at 1 MHz and the second one at 1.1 MHz is 
shown in Fig. 11.23. The results are again in good agreement with the OIP3 estimated 
with the help of the above equation which gives 10.1 and 10.7dBV for a single 
capacitor and for back-to-back capacitors respectively. 


11.4.4 Large Signal Effects 


In this section we evaluate gain compression and amplitude-modulation to phase- 
modulation due to the nonlinear capacitor. The onset of both of these effects is 
governed by the third order transfer function evaluated at the frequency mix m = 
(0, 1, 2, 0) relative to the linear transfer function at the fundamental 


232 11 Weakly Nonlinear Time Invariant Circuits 


A3(J@1, J@1,—J1) 
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@zag 3 L\c1/ @3aB ci 
where we have assumed 2w; < w3ap. The phase of this expression determines the 
presence of gain compression or expansion and AM2PM. 

As a concrete example, we consider again a low-pass filter with a cut-off fre- 
quency of 5MHz, R = 1 kQ, the nonlinear capacitor with the characteristic shown 
in Fig. 11.15a and driven by a sinusoidal tone at 1 MHz. In this case the term in 
the square bracket in the above expression, multiplied by minus one, evaluates to 
—0.6 + 70.94. As the real part is negative we expect some gain compression. How- 
ever, the imaginary part has a larger magnitude which implies that AM2PM should 
be somewhat more pronounced. If we use (11.7) to estimate the amplitude of the 
input tone producing a phase change of 1° we obtain a value of -67.3dBA which 
corresponds to an output swing of 0.61 mV. A look at Fig. 11.15a shows that at these 
levels a second order approximation of the capacitor characteristic is a very poor 
approximation of the real characteristic. For this reason we can’t expect this estimate 
to be accurate. 

A believable prediction can be made for levels where the approximation is good. 
For example, a phase change of 0.1° is predicted to happen at an input signal level 
of -77.3 dBA which corresponds to an output swing of 0.193 mV. Similarly, (11.5) 
predicts a 10mdB gain compression at an input level of -77.2 dBA. These levels 
compare quite favorably with the values obtained by a numerical simulation and 
shown in Fig. 11.24. The simulation shows that these effects remain very small up to 
the large output swing of 1 V RMS which is close to the reliability limit supported 
by these devices. 


11.5 Class-AC Common-Source Stage 


In this section we analyse the common-source stage shown in Fig. 11.25a for use as 
an RF amplifier. In particular, we are interested in the distortion introduced by the 
nonlinear i — v characteristic of the transistor and in the influence on distortion of 
the choice of gate bias voltage Vg. For simplicity in this section we neglect the Cga 
capacitance. We will consider circuits with some form of local feedback in a later 
section. 

The following is a simple large-signal MOSFET model presented in many text- 
books [27, 28] 


0 ves — Vr <0 
ip = į K'¥ (ves — Vr — B8)ups(1 + Avps) 0 < vps < (ves — Vr) 
£ (wgs — Vr’ (1 + Avps) 0 < (vgs — Vr) < vps. 
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Fig. 11.24 Simulated AM2PM and gain compression of the LPF with a nonlinear capacitor having 
the characteristic shown in Fig. 11.15a and driven by a tone at 1 MHz 
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Fig. 11.25 a Common-source amplifier AC schematic b Common-source amplifier small-signal 
model 


The second equation describes the so-called linear region of the characteristic. This 
is the region where the overdrive voltage vgs — Vr is sufficiently large to cause a 
conductive surface charge channel in the active area at the surface between source 
and drain of the transistor and vps is sufficiently small that the channel extends 
all along from the source to the drain terminal of the transistor. In this region the 
transistor behaves essentially as a nonlinear resistor. 

The third equation describes the saturation region of the characteristic and is the 
one of interest for implementing amplifiers and most other analogue circuits. In this 
region vgs — Vr is sufficiently large to cause the formation of a conductive channel. 
However, vps is larger than the saturation voltage which means that the channel 
is present close to the source side of the transistor, but doesn’t extend all along 
to the drain terminal. In this region the current through the transistor ip is almost 
independent of the drain voltage and the transistor behaves to a good approximation 
as a voltage-controlled current source with vgs the control voltage. The parameter 
à takes into account the fact that the length of the channel does depend on the drain 
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Fig. 11.26 a FinFET input side characteristic. L = 22 nm, nfin 
Nfinn ¢m71 nmb FinFET output side characteristic. L = 22 nm, ngin 
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voltage and makes ip a weak function of the drain voltage [28]. In this simple model 
the saturation voltage is equal to the overdrive voltage vgs — Vr. 

The characteristic of real transistors depends on many effects not captured by 
this simple model. To enable the design of analogue circuits, very accurate transistor 
models have been developed and made available in circuit simulators. Unfortunately, 
most of those models depend on several dozens to hundreds of parameters making 
them unsuitable for analytical estimates. Figure | 1.26a shows the characteristic of a 
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FinFET with a channel length L = 22nm as given by the CMG-BSIM model [29] 
with parameters from [30]. Figure 11.26a shows ./7p as a function of vg with the 
source connected to ground and vp at a fix potential of 0.4 V. It shows that between 
0.35 and 0.65 V the deviation of the characteristic from a straight line as predicted 
by the above simple model is quite small. Figure 11.26b shows ip as a function of 
up for a fix gate voltage of 0.5 V. Here as well, the simple model gives a fairly good 
approximation over an extended range of the characteristic. The pictures show the 
values of K’, Vr and A obtained by fitting the model to the curves. 

Using the simple model the current ip can be split in two parts 


ip =ipatipo 


with 


/ 


ipa = a gss —Vry? 


iD b = 8o(ip,a)Ups = ip,advps . 


The current ip a can be interpreted as the output of an ideal voltage-controlled current- 
source, while the current ip , can be interpreted as the current due to a nonlinear load 
resistance. Since an ideal current source is not affected by its load, from an analysis 
point of view, it is convenient to analyse the two parts separately and combine the 
effects with the results of Sect. 10.1. For this reason we lump the components to the 
right of line A in Fig. 11.25b into a nonlinear load. In this section we focus on ip a. A 
common nonlinear load will be considered in the next section. Similarly, for analysis 
purposes, the nonlinear C,,, capacitance can be considered part of the driving circuit. 
In the case of a resistive source we can reuse the results of the previous section 
with minor modifications. Often however, the distortion introduced by Cga is small 
compared to the one introduced by the i-v characteristic. In the following we will 
simply write ip for ipa. 

While the above model can be used to obtain a relatively good approximation 
of the transconductance g,, of the transistor, it doesn’t provide a good estimate of 
higher order distortion terms. Therefore, to analyse distortion we approximate the 
transistor characteristic around the operating point by a third order polynomial 


. 2 3 
ld = 8mUgs F §2U 55 F E3 Vgs 


and extract the coefficients from simulation. Figure 11.27 compares first, second and 
third order polynomial approximations to the full characteristic at a bias level of Vg 
=0.5V and Vp = 0.4V. At this bias level a third order approximation provides a 
good approximation up to a signal level of about 150 mV. Figure 11.28 shows the 
three coefficients gm, g2 and g3 as a function of the gate bias voltage Vg simulated 
using CMG-BSIM models. While the simple model predicts a vanishing third order 
coefficient g3 the picture shows that it disappears only at a single gate bias point. 
For small gate bias voltages the g3 coefficient is positive, while for large values it’s 


236 11 Weakly Nonlinear Time Invariant Circuits 


0.012 — 
iD 
1, ord. approx. 
01 b 2 ņ ord. approx. — — — 
00 3 ord. approx. 
0.008 | 
0.006 | 
T 
A 
0.004 H 
0.002 + 
0 Se 
-0.002 l ! 
0.3 0.4 0.5 


Fig. 11.27 Polynomial approximations of the transistor characteristic around Vg = 0.5 V. Vp = 
0.4 V, same transistor size as in Fig. 11.26a 
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Fig. 11.28 First three coefficients of a polynomial approximation of the transistor characteristic as 
a function of the gate bias point. Vp = 0.4 V, same transistor size as in Fig. 11.26a 
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Fig. 11.29 Second and third order coefficients of a polynomial approximation of the Class-AC 
stage characteristic as a function of the gate bias deviation from the nominal gate bias point A VG 
for Vp = 0.4 V 


negative. We may try to minimize third order distortion by biasing the transistor at 
the bias point at which g3 is zero. However, this strategy doesn’t lead to a robust 
design. In fact mismatch between the transistor and the bias devices introduces a 
statistical Gaussian bias error with a typical standard deviation of order [31] 


— >mV- pm 


í VWL 


where L and W are the length respectively the width of the active channel. A more 
fruitful approach is to use two transistors connected in parallel, but biased at different 
bias levels. One biased at the minimum of g3 and the second at its maximum. The 
relative size of the two transistors is chosen in such a way as to make the sum of 
the g3s cancel. In this way the deviation of the bias point of each transistor due to 
mismatch has a smaller impact on the value of g3. The resulting effective g3 of the 
transistor couple, a so called Class-AC stage, is shown in Fig. 11.29. 

The IIP3 of the stage can be estimated from (11.11). For a single transistor biased 
at Vg = 0.46 V we read from Fig. 11.28 gm ~ 15mS, g3 ~ —110 mA/V°? giving an 
IIP3 of ca. -10.4dBV. From Fig. 11.29 we see that a Class-AC stages reduces g3 by 
ca. a factor of 10, while leaving g,, almost unchanged. From this data we estimate that 
the IIP3 should be ca. 10 dB higher or approximately —0.4 dB V. The results obtained 
by numerical simulation with the full transistor models are shown in Fig. 11.30. 
To suppress the effect of the nonlinear output conductance g, the drain was held 
at 0.4V using an ideal voltage source. The circuit was driven by a voltage source 
with a resistance of 50 Q generating two tones of equal amplitude at fı = 1.01 GHz 
and f2 = 1.02 GHz. Note that the simulation does include the effect of a slightly 
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Fig. 11.30 Simulated IM3 of a Class-AC stage compared to the one of a simple common-source 
stage consisting of the Class-A device only. Class-A device: L = 22nm, nfin = 10,n ¢ = 16,m = 1 
biased at Vg = 0.453 V. Class-C device: L = 22nm, nfin = 10, nf = 8, m = 1 biased at Vg = 
0.342 V. Vp = 0.4 V. |v;| is the magnitude of each of the two input tones 


nonlinear Cz, as well as the one of Cga. The results are in good agreement with our 
estimates up to a level of about -25 dBV (~ 80 mV) per tone that translates in a peak 
input voltage of 160 mV. This is in line with expectation as beyond this level the third 
order approximation of the characteristic starts to break down as noted earlier. 

The Class-AC stage reduces g3, but doesn’t reduce g2. Therefore, if the second 
order transfer function of the preceding or following stage is also large, then the 
combined system will still produce third order distortion. If we call the first subsystem 
G and the second one H the combined third order impulse response is in fact (see 
Table 10.1) 

(ho g)3 = hy * g3 +2h2 * [81 @ 82] ym + h3 * ge 


which doesn’t disappear even if g3 and h3 are both zero. One approach to reduce g2 
(on top of g3) is to use a complementary structure comprised of an nMOS Class-AC 
stage and a pMOS one as sketched in Fig. 11.31. Here we use common-gate stages 
(see the next section) to reduce the effects of Cga and combine the currents through 
a transformer. For good results one needs large coupling between the primary and 
secondary of the transformer. In a monolithic implementation this is best achieved 
using equal coils stacked one on top of the other. We can also directly connect the 
drains of the two stages. In this case the bias currents of the two stages must coincide 
and a mean of controlling the DC drain voltage is necessary. 
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Fig. 11.31 Complementary Class-AC stage suitable for RF applications 


11.6 Common-Gate Stage 


In this section we investigate the linearity properties of the common-gate stage shown 
in Fig. 11.32a. We first consider the case in which the stage is driven by a source with 
internal resistance R, and then specialise to the case in which the stage is used to form 
a Cascode. A basic variant of the Cascode stage suitable for use at RF frequencies 
is the combination of a common-source stage followed by a common-gate one. The 
combination of the two stages behaves as an improved common-source stage with 
much reduced C,q and output conductance g, [27]. In this section we will show that, 
under suitable conditions that we will work out, the addition of a common-gate stage 
does not degrade distortion either. Due to these very desirable benefits the Cascode 
stage is a widely used configuration. 
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Fig. 11.32 a Common-gate stage AC schematic b Common-gate stage Small-signal model 
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Consider the small-signal model shown in Fig. 11.32b. The input voltage v; corre- 
sponds to the source voltage. The input current is the current entering into the source 
terminal. The part of the input current that doesn’t flow through C,, is labeled i, and 
represents the current that flows through the transistor active channel to the drain. 
The current leaving the drain terminal must therefore have the same value. This is 
represented by the output side current-controlled current source with unit gain. For 
simplicity, we neglect the distortion introduced by the nonlinear capacitance C,, as 
well as the one introduced by the drain capacitance that in the figure was lumped 
together with the load Zz. As before we characterise the linearity of the circuit by 
calculating the nonlinear terms present in the output current ic. 


11.6.1 Nonlinear Transfer Functions 


According to the model presented in Sec. 11.5 (with A = 0) the static characteristic 
of the transistor in saturation is given by 

. 2 

Ip = =v 


with vop = vgs — Vr the overdrive voltage and 8 = K’W/L. In the present situa- 
tion it is more convenient to express the input voltage as a function of the current. 
This is easily achieved by inverting the equation. If we further separate the DC bias 
terms from the small signal quantities we obtain 


r 2p +ia) y 
=a m YOD 
5 B 
which we approximate by a third order Taylor polynomial around the operating point 
lia lyi? 1 ig 3 
oy (3) +o) | 
ve oo] 5 In 8\Tp) * 16\7p 


Using the relations v; = —vg, and i, = —ig we obtain that the input characteristic 
corresponds to the one of a nonlinear resistor 


vi = Nie + ni + rig (11.37) 


with 
1 = Vop Von Vop 


i ee pga a 11.38 
7 25I” PTER i 


Bm 2Ip i 


Note that while the original expression giving ip as a function vop doesn’t include 
any third order term, the inverted expression gives a well-defined term of third order. 
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As a result, the latter is less sensitive to modeling inaccuracies than the former. We 
will therefore use the above values as estimates forr;,i = 1,..., 3. 

Using the above third order polynomial to model the source-gate nonlinear char- 
acteristic we obtain the equivalent circuit shown in Fig. 11.33 with V) and V3 the 
second resp. third order equivalent nonlinear source as given in Table 11.11. 

The first order transfer function is calculated by discarding the contribution of all 
sources of order different from one. This amounts to calculating the contribution due 
to the input source and using a Dirac impulse as input signal. Working in the Laplace 
domain, the Kirchhoff’s voltage law gives 


Rs Ue + SCsgrile1) + rites = 1. 


Solving for the first order component of J, we find 


1 1 1 
Rs +r, + sCyeri Rs = Rs +r Lees 


La(s) = Ay(s) = 


with wo = (Rs + r1)/(Csgrı Rs). 

With Z<, ı and referring to Table 11.11 we can now compute the equivalent source 
of second order Vz = r21e,1(s1)Le,1 (s2). The second order transfer function is the 
response to this source which is easily calculated to be 


1+ (sı + $2) Csg Rs 
R; Trt (sı + S2)Csgr] R; 


T..2(81, 82) = H(s1, s2) = rale 1 (S1)Le,1 (82) 


or, expressed in terms of Hy 
Ay (81, 82) = —ro[1 + (s1 + 52) Cog Rs] Ay (s1 + 52) Ay (81) Ay (s2) . 
With J... we can compute the equivalent source of third order 


V3 = 2ro [Te,1(s1)Le,2(s2, $3) Jey + r3Le,1(S1) Lc,1 (82) Le,1 (83) - 
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The third order transfer function is the response to V3 which is calculated in a similar 
way as Hp 


3 (81, 82, 53) = —[1 + (81 + 82 +53) Cs Rs] Hi (51 + 82 + 53) V3 (51, $2, 83) 


11.6.2 Cascode 


We now specialise to the case of a Cascode. Since the transfer functions are found 
by analysing a sequence of linear networks, we can use the Thévenin-Norton theo- 
rem [32] to transform the source into the parallel connection of an ideal current source 
and the internal resistor R, as shown in Fig. 11.34. The resistor R, corresponds now 
to the reciprocal of the output conductance g, of the driving common-source stage. 
The latter is usually much larger than r4, so it has little effect on the operation of the 
circuit. For this reason and to obtain easier to interpret expressions we calculate the 
transfer functions in the limit as R, tends to infinity. Under this assumption and using 
the results of the previous section, the first, second and third order transfer functions 
from the ideal source Z, to the output current I, are 


Hals) := im. Hı (s) R; = ; (11.39) 


f= 


Heo(s1, 82) := lim Ap(s1, 2) Re 
Rs—>00 (11.40) 
= =r (s1 + S2)Csg Hei (81) Hei (82) Hei (S1 + 52) 
and 
Ha (s1, s2, 83) = lim H3(s1, s2, 53) R? 
R;—> œ 
= (s1 + s2 + 33)Cse [273 [Hai (s1 + 52)(51 + 3)Csslyn — 73] 
» Aes (81) Hei (82) Hes (83) Hei (s1 + 52 + 53) 


(11.41) 
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respectively, where now wo = 1/(Csgr1). Note that the symmetrization in H;3 is 
intended over all three Laplace variables s1, s2 and s3 


1 
[Her (st + 52)(S1 + 82) Cog] = 3 | Het Si + 52)(51 + 52)Cog 
+ Hals +53) (81 + 53) Csg 
+ Hei(s3 + 52)(53 + s2)Cse } . 
Consider now the classic two-tones third order intermodulation test with one tone 
at w and the second one at œw = wı + Aw. In particular consider the IM3 tone 


characterised by m = (1, 0, 2,0). Assuming |Aw| < |@ | the above symmetrised 
expression can be approximated by 


2 JaiCsg 
He1 (51 + $2)(81 + S2)Cs a 
[ # Jeym 31+ J 
and, with it, the third order transfer function by 


He3(J@1, JO1, —J@2) 


4 
~ yoo Csg | 315101 Cog Her Cjo) = r3 | Her (Jor) Het (Jer) Het (—J 01) Het (Jo) 


If @; < wo/5 the value of | H.;(j@1)| can be approximated by 1 with an error of less 
than 2% and the magnitude of | H.3| becomes very nearly 


4 
372) 1 Cog Hei (2J01) = r| : 


@) Csg 
Using (11.38) for the coefficients of the nonlinear characteristic of the transistor we 
thus obtain 


1 w1Csg 
81 Em 


2 J@C 
3 8m 


|Hea(Joor, Jor =o) © Hajo. (11.42) 


The magnitude of the IM3 tone normalised to the DC current Ip is therefore 


ss 3 w1 Csg 
32 gm 


2 JaiCc 
3 8m 


Ie3,m 
Ip 


3 


£ Haja) — 1| 


I; 
Ip 
From this expression we can read several interesting aspects. First, both the second 
and the third order nonlinearities of the transistor characteristic contribute to the IM3 
tone. This is visible from the appearance of r3 as well as r2 in the expression for 


H3. The contribution to an intermodulation product of third order by second-order 
nonlinearities is due to the presence of (local) feedback. This can be appreciated 


244 11 Weakly Nonlinear Time Invariant Circuits 


graphically by looking at Fig. 11.34. The second order source Vz creates a current 
that circulates again through the input of the circuit. Therefore, the generated second 
order tones pass again through the second order distortion where they can mix with 
the input tones to produce frequency mixes of third order. 

The contribution to the IM3 tone from second-order distortion is approximately 
orthogonal to the one from third order distortion. Therefore, it’s not possible to size 
the transistor in such a way as to make the two cancel each other, not even at a specific 
frequency. 

The IM3 is largely dominated by r3 up to very high frequencies and for w; up to 
ca. wo /10 it is proportional to w;. The quantity gm /Csg corresponds (neglecting Cga) 
to the angular frequency at which a common-source stage has unity current gain. It 
is called transit frequency and denoted by 


w Em 
T= . 
Cog 


(11.43) 


It is one of the key parameters used to characterise the high-frequency capabili- 
ties of transistors. With it the magnitude of the IM3 up to ca. w < œwr/10 can be 


approximated by 
3 


lam) w 3 aj 
Ip 32 wr Ip 


This shows that for low distortion one needs fast transistors. Looking again at 
Fig. 11.34 we can appreciate that in the limit as w;/wz tends to zero (which means 
that C,, tends to zero) the nonlinear sources become floating and can’t generate any 
frequency mix current (remember that we also assume R, — ov). 

In general, distortion introduced by the input (common-source) stage of the Cas- 
code configuration generates frequency mixes of second-order. These can mix with 
the fundamental tones in the second-order distortion of the output (common-gate) 
stage to produce other IM3 components. However, since | H.2(j2@1, —j@2)| is also 
proportional to wı/ær this does not substantially change the situation. 

For simplicity in our discussion we assumed R, — oo. From the gained insight 
we can appreciate that at low frequencies it is a finite value of Rẹ which will limit 
IM3 and, the lower R,, the higher the IM3. In general however, the common-gate 
stage of a Cascode is not the stage limiting low frequency linearity. 


11.7 Degenerated Common-Source Stage 


In this section we investigate the effect of local feedback on distortion and show 
that introduction of feedback may lead to degraded linearity. As a concrete example 
we analyse the degenerated common-source amplifier depicted in Fig. 11.35a. The 
impedance Z, is called the degeneration impedance. Its presence reduces the gate- 
source voltage across the transistor by an amount proportional to the output current. In 
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Fig. 11.35 a Degenerated common-source stage AC schematic b Degenerated common-source 
stage small-signal model 
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other words it introduces feedback around the transistor. The impedance Z, represents 
a generic driving impedance. 

One way to analyse the circuit is to model the transistor as a nonlinear voltage- 
controlled current-source characterised by a third (or higher) order polynomial 


. 2 3 
ld = 8mUgs F E2V gs + E3 Ugs 


and solve Kirchhoff’s equations for vgs. Having found the voltage components 
Vgs,1, +++, Ugs,k Up to some order of interest k, one then finds the output current 
components io,1, . - . , o k by use of the polynomial approximating the transistor char- 
acteristic. Instead of using this method we show how the use of a nullor allows the 
problem to be solved in a more direct way, by permitting to directly obtain an equation 
for the output current i,. 

Nullators and Norators are pathological network elements. A Nullator is a two 
terminal element represented by the symbol shown in Fig. 1 1.36a and characterised 
by the two equations 


A Norator is a two terminal element represented by the symbol shown in Fig. 11.36b 
whose current and voltage are arbitrary and completely determined by the surround- 
ing network. In other words it is characterised by zero equations. For a linear network 
to have a well-defined solution a Nullator must therefore always appear alongside a 
Norator. Such a pair is called a Nullor and can be used to model several elements 
such as controlled sources, OpAmps and transistors. In particular, we can use it to 
represent the inverted series (see Sect. 11.6.2) 


Ugs = ryig + rig =r: r3iq 
of the transistor characteristic. A nullor based small-signal model of the degenerated 


common-soutce stage using this transistor characteristic representation is shown in 
Fig. 11.37. Note that the transistor characteristic is represented by a nonlinear resistor. 
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Fig. 11.36 a Symbol of the Nullator reminding the shape of the zero digit 0 b Symbol of a Norator 
reminding the infinity symbol oo 


11.7.1 Nonlinear Transfer Functions 


From the model in Fig. 11.37 and Kirchhoff’s laws we obtain the following system 
of equations relating the output current J, to the input signal V, 


1 
V; = (Zs a ys + Zed F Io) 
SCgs 


v. = 1 
T SCi 


Vgs = rilo trol? +731? . 


I; 


After eliminating Vgs and J; we obtain the single equation 


V, = [r + Ze + (Zs T Ze)rı Cess lo 


(11.44) 
+ [1 + (Zs + Ze)Cgss | (r212 +1312). 

The first order transfer function is obtained by applying a Dirac impulse as input 
and discarding all terms of order higher than one in the equation. This is equivalent 
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to removing the nonlinear sources from the equivalent circuit. Using the relation 
rı = 1/gm we obtain 


Em _ Em 
L(s) 1+ gnZe + SCgs(Ze + Zs) 


HGy= (11.45) 


To compute the second order nonlinear transfer function we first insert the first 
order solution into the nonlinear terms and retain only second order ones. Alterna- 
tively we can use Fig. 11.11 to read the value of the second order nonlinear source for 
a nonlinear resistor. In both cases, after adjusting the representation of the differential 
operator by replacing the variable s by sı + s2, we obtain 


0= [r + Ze + (Z; + Ze) Cos (Sy + s2)|M(s1, s2) 
a [1 F (Zs F Ze)Cgs (S1 T s2) |21 (sı) Mı (s2) : 


Note that for brevity we didn’t explicitly write the argument of impedances. Their 
value has of course to be evaluated at sı + s2. The second order nonlinear transfer 
function is thus 


Hp(s1, 82) = —r2 Hy (s1) Hı (s2) Hi (s1 + s2)[1 + (Zs + Ze)Cos (81 + s2)]. (11.46) 


To find the third order nonlinear transfer function we proceed in a similar way 
and obtain 


H13(s1, 82, 83) = — {2r2 [H1 (81) H2(82, 83) leym + r3 H1 (s1) Mi (2) Mi (s3) f 
Hy (sı + 52 + 83)[1 + (Zs + Ze)Cgs (s1 + 52 + 53)]. (11.47) 


11.7.2 Resistive Degeneration 


We now specialise to the case of resistive degeneration Ze = R, and aresistive driving 
impedance Z, = R, and calculate the intermodulation products of third order when 
driven by two tones of equal amplitudes at frequency w; and w; + Aw respectively. 
As usual we assume Aw X w]. 

As a first step, to calculate H3 for the mix (1, 0, 2, 0) we evaluate 


[Hi (Jo) HMJ, — J (@ + A@)) sym 


Bin = xeon] 
3 LGgai2L(—jo) | L(-jAa) | LQ) 


Dd 


with 
N(s) := 14+ (Zs + Ze)SCgs « 
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Inserting this expression into H3 we obtain 


A3(J@ 1, J@1, —] (@ + Ao)) 


gi Nai) $ 3 | N(-j Ao) reo | 
~ 158m 2 r 
L(ja1)3L(— Ja) |3 L(-jAo) — L(2a1) 


Since in Sect. 11.5 we characterised the transistor in terms of gm, g2 and g3, we 
express the coefficients r2 and r3 in terms of them using the results of Sect. 11.2.1 


_ 82 


293 — 8m83 
n> 3 = —— z. 
Em 


n= 
Sin 
Substituting these expressions leads finally to 


A3(Ja1, J@1, —J (@1 + Ao)) 


~ N (Jæ) f Fee 1 N(2%1) |+ | 
LGPL Jo) \ gn L3 LEJAo) | 3 Lo) srj 


(11.48) 


We can now discuss the effect of a small amount of feedback introduced by a small 
resistor R, on linearity. First note that, as expected, for Ze = 0 Q the term in square 
brackets vanishes making the IM3 depend only on g3. As Re is increased the contri- 
bution of go increases and at low to moderate frequencies there is some possibility 
of cancelling between the contribution due to g3 and g2. As Re increases beyond 
this value, the second order contribution starts to dominate. At high frequencies only 
imperfect cancelling is possible due to shift in phase of the gz contribution. 

Figure 11.38b shows the low to moderate frequency IIP3 of the Class-AC stage 
from Sect. 11.5. It shows that cancelling occurs for very small amounts of feedback 
and, as is typical for cancelling effects, the performance is very sensitive to small 
variations in component values. Due to the large value of go, a small to moderate 
amount of feedback with g, Re in the range of 0.03—2.5 leads to an actual degradation 
in IIP3. Note that small values of Z, may be introduced unintentionally by parasitic 
effects due to the interconnections between components. 

A linearity improvement can be obtained by using a large amount of feedback 
8m Re > 1. To simplify calculations let’s assume lac es(Rs + R.)| < 1, then 


1 & 
A3(jo1, Jwi, — J (@1 + Aw)) © | 222 + g3 
L(JæPL(—jJo) Em 


and r 
Hi (Jo) © R 
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Fig. 11.38 a Weak feedback (a) 10 
region of Fig. 11.38b b 
Moderate-frequency 

resistively degenerated 5 
common-source 
transconductance stage IIP3. 
The solid curves are 
calculated using 

(11.45), (11.47); the dashed 
line with (11.49). The 
transistor parameters are 5 
approximate values taken 

from Figs. 11.28 and 11.29: 

8m = 17mS, g2 = 70mS/V, 

g3 = 10mS/V2, R, = 50 Q -10 


Using (11.11) to calculate the IP3 shows that under these conditions the latter does 
in fact increase with increasing Re 


A(gmRe)? 


2 
§3 82 
3/2 -2(%) 
The reason for the improvement is a substantially reduced amplitude of the voltage 
Vs controlling the nonlinear sources compared to the circuit input signal V,. Linearity 
thus comes at the expenses of a much reduced signal transconductance which for RF 
circuits is often not acceptable. 


IIP3 ~ ; &mRe > 1. (11.49) 
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11.7.3 Inductive Degeneration 


A second type of degeneration widely used ar RF frequencies is the inductive one. 
This type of degeneration is often used in the input stage of low-noise RF amplifiers 
(LNAs), a basic small-signal model of which is shown in Fig. 11.39. 

An important characteristic of RF amplifiers is the input impedance Z;. In many 
situations it is required to be real and equal to the source impedance R,, or some 
standard value. From our model a simple calculation shows that Z; is given by 


L 1 
Zi = Ri + JXi = Bm —— +s(L; + Le) + . 
Cos SCgs 


A degeneration inductor Le thus allows a real part to be introduced to the input 
impedance without using resistors. Avoiding resistors at the input of LNAs is neces- 
sary to avoid limiting the achievable sensitivity. The reactive part of the impedance 
can be cancelled over some frequency band by resonating it, in our example using 
the inductor L,. The input network thus consists of a series resonator tuned at the 
center frequency of the band of interest. 

In this section we analyse the linearity characteristics of this stage and in particular 
its IP3. The nonlinear transfer functions are readily obtained from our previous results 
by setting Ze = s Le and Z; = R; + sL;. Doing so, the first order transfer function 


becomes 
8m 


1+ S(8mLe + Cgs Rs) +F gs Ge (Le F Ls) 


Ay (s) = 


Fig. 11.39 Small-signal 
model of an inductively 
degenerated common-source 


1 
y 
stage R, L, 6 


JO | PO 


Uj 
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Note that g&mLe = Cgs Ri. We can therefore write the denominator in the standard 
form 


Hy (s) = — 4 — (11.50) 
Lp oot) 
with 

1 1 
= Call t Ly) 1i = RoC 

gs\Le s iW0U gs 
ee 1 
et — ies — qs = >= . 
qt qi qs Rs 0C gs 


The same parameters can also be used to put N (s) in standard form 


s l s N2? 
Ns) =1+——+(=) . 
wo qs Wo 
The value of H; relevant for the two-tones IP3 test can then be obtained by substi- 
tuting these expressions in (11.48). The resonance frequency of the input resonator 
is evidently set to the frequency of the input signal wọ = @ so that 


No) =, Lye.) = me Hi (Jo) = —Jqgm. 
Ss t 
and 
eat (@ + Ao)) IG) | 83 Ppa N lg 
w1, J@|, —J(@ @))& F 
BE eee q l elo 320m) = 


With these results we can compute the ITP3 using Eq. (11.11) as before 


2 1 
mp3 ~ |E 
qt | qr 2| 2 1 No) 83 
2(#) E F 3 La) = J + A 


(11.51) 


In the common case in which the input resistance R; is equal to the source impedance 
q4s/qı = 2. The IP3 of the circuit is thus approximately inversely proportional to the 
quality factor of the input resonance. This is due to the fact that at resonance the 
magnitude of the voltage across the reactive components is roughly q; times the one 
across the resistive part. In other words, the voltage V,, controlling the nonlinear 
sources is amplified by a factor of ca. q; compared to the input signal V,. This very 
same characteristic is also the reason for the good noise characteristic of the circuit: 
the input network provides some voltage gain before the first noisy device. 
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The quality factor of the network also influences the relative contributions of g2 
and g3 to distortion through the term 


NQya1) _ —3 +92 
Lja) —3+ 52° 


For large quality factors g,, qs >> 1 the ratio approaches 1 which makes the IP3 
essentially independent of g2. For small quality factor values the contribution due to 
g2 is not negligible, especially if g3/gm is small compared to (g7/gm)* as is the case 
with Class-AC stages. 

In practical implementations the component values are affected by manufacturing 
variations. For this reason and to avoid the need for tuning, the quality factor q, is 
most often chosen to have a value smaller than 5. 


11.8 Pseudo-Differential Circuits 


The analog signal path of many RF and mixes-signal integrated circuits is differential. 
This means that the signal of interest is transmitted on two equal lines carrying the 
same signal, but with opposite polarities. The main objective is to make the system 
insensitive to noise affecting both lines equally. This can be, for example, noise due 
to the activity of digital circuits propagating through the common substrate of the 
IC. A differential circuit is one that is designed to process the difference between 
the two input terminals sensing the two lines carrying the signal and rejecting the 
common component. Formally, if v and v; are the two input voltages (relative to 
ground), the differential-mode voltage is defined as 
+ 


Ug =U; — V; 


and the common-mode voltage as 


The prototypical differential circuit is the differential-pair shown in Fig. 11.40. In the 
ideal drawn form the output currents are always i} = i; = Io/2 as long as v} = v7. 
Any common-mode signal component is thus fully rejected. 

Differential circuits do also have disadvantages. A real current source is imple- 
mented with transistors and requires a certain voltage across its terminals to work 
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Fig. 11.40 Differential pair | | 


Fig. 11.41 Pseudo- i i 
differential transconductance 


properly. This reduces the headroom left for signal processing and in modern pro- 
cesses operating at supplies voltages below 1.0 V poses severe challenges. In addi- 
tion, a current source does not only generate a DC current, but it also generates noise, 
reducing the sensitivity of the circuit to small signals. 

Pseudo-differential circuits are a class of circuits that alleviate some of these prob- 
lems while retaining some of the benefits of differential circuits. They are circuits 
composed by two equal single-ended sub-circuits each connected to one of the two 
lines carrying the differential signal. An example pseudo-differential transconduc- 
tance is shown in Fig. 11.41. 

In pseudo-differential circuits the input common-mode signal component is not 
rejected, but, if the circuit is sufficiently linear, the common-mode input appears 
as a common-mode signal at the output and remains separable from the wanted 
differential signal which appears at the output in differential form. The objective of 
this section is to quantify the conversion between common-mode and differential- 
mode in weakly nonlinear circuits. 

We first show that weakly nonlinear circuits driven by a purely differential input 
signal produce a mixture of differential- and common-mode output signals. Let’s 
denote the input signals by x* , x~. the output signals by y*, y7, the relative common- 
and differential-mode components by the same letter with index c and d respectively; 
and the nonlinear transfer function of order k of the single-ended subsystems by hg. 
By assumption the input signal is purely differential 
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The outputs of order k are therefore 


from which we conclude that for k even the output is a common-mode signal, while 
for k odd it is differential. 

Let’s now consider the response of a weakly nonlinear circuit to a mixture of 
differential- and common-mode signals 


Xd = 
xt = xe + 5 KO =H KES Ss 


2, 2 


Let’s first consider the second-order response of the two circuit halves. The positive 
and negative outputs are 


Xd 92 
yi =h, * (+7) 


1 
= hy xg? + Tha xg? + h * [xa ® Xe] 


sym 
and 
Xd we 
yy =h * Xe 
@2 1 82 
= hy * x Tarti — hz * [Xa Q Xelsym 


respectively. The second-order differential output signal component is therefore 
Ya,2 = 2h * [Xa ® Xelsym 


which includes the common-mode input signal. A similar calculation for the third 
order component gives 


@3 


X 
ya,3 = h3 * (= +3 [xa 8 A 


which also includes a term depending on the input common-mode. One can generalise 
the calculations and show that the differential- and common-mode input components 
are mixed by nonlinearities of all orders. 

Consider now the cascade of two pseudo-differential weakly nonlinear circuits 
driven by a purely differential signal. If the two subsystems are optimised inde- 
pendently to maximise IP3 without paying attention to even order distortion com- 
ponents, then, when the two subsystems are put together, one may obtain a lower 
than expected total IP3. This is because the first stage produces second (and higher 
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even) order mixes as common-mode signals which are also fed as input to the sec- 
ond subsystem. The second (and higher order) distortion components of the latter 
will then mix differential- and common-mode to produce differential output signal 
components at the IM3 frequencies. 
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Chapter 12 A) 
Linear Time-Varying Systems get 


12.1 Linear Time-Varying Systems 


In this chapter we consider linear time-varying (LTV) systems. These are systems 
whose behaviour depends on the particular moment in time at which they are used. 
The change with time may arise for example due to the sensitivity of system com- 
ponents to environmental changes. Examples of systems suffering from this type 
of sensitivity include wireless communication systems in which the communication 
channel between the transmitter- and the receiver-antennas is highly dependent on 
the environment in between and around the antennas. The variation in time may also 
be imposed intentionally by design to achieve functions that can’t be realised with 
LTI-systems. This is the case for example in communication mixers whose function 
is to shift in frequency the spectrum of a signal. 

In this section we introduce a definition of linear time-varying systems valid 
under the assumption that all signals are regular distributions. A generalisation will 
be given in Sect. 12.3. The assumption of linearity means that the superposition 
principle must hold. In addition, as for LTI-systems, we require that LT V-systems 
depend continuously on the input signal. We therefore define 


Definition 12.1 (LTV-system) A single-input, single-output, linear time-varying sys- 
tem is a system that when driven by the input signal x produces a response y that 
can be expressed by 


y(t) = AG, E) x x(t) := f h(t, E)x(t — §)dé . (12.1) 


h(t, €) is the time-varying impulse response of the system. 


The meaning of the variable £ is best illustrated by anticipating somewhat the results 
of Sect. 12.3 and apply as input signal a Dirac impulse at time to 


y(t) = h(t, E) xı ôt — to) = h(t, t — to). 
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Thus £ represents the time lapsed since the application of the input impulse. 

For causal systems the output must vanish before the input is applied. This implies 
that the impulse response must vanish for negative values of £ 


AEs, <0. (12.2) 


Therefore, the response of a causal system driven by a regular distributions x € D‘, 
is given by 


yn = | na, sx(t— sae = | h(t, t — §)x(&)dé . 


12.2 Linear Ordinary Differential Equations 


An important class of LTV-systems is the one of systems described by differential 
equations with variable coefficients of the form 


ee =e 
(5) = (gho 


with 


a d d” 4: A q”-! ee w) 
at) dem T N Gym oor 


N |t d ae ra +b uae + bo(t) 
Madey ge RS! eet : 


time-dependent differential operators. It’s easy to verify that every such system with 
n < m can be represented in a state-space representation with time dependent matri- 
ces 


7 = A(t)u + B(t)x A(.) € C(R, C”*”), BC.) € C(R, C™!) (12.3) 


y=C(tu+ Dit)x CecCc(R,C!*"), DECR,C) (12.4) 


with u the system state. Given an input signal x, the system response y is fully 
determined if one can find a state u satisfying the first equation and suitable initial 
conditions. The study of the dynamics of the system can therefore be reduced to the 
study of a system of n differential equations of first order. 
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12.2.1 Fundamental Solution 


Consider the initial value problem described by the system of n differential equations 


d 
—y= A(t 12.5 
a (t)y (12.5) 
and initial conditions 
y(0) = yo € C” (12.6) 


with A(.)ann x n matrix of complex valued functions of time a;;(.). If the functions 
forming A(.) are bounded and continuous, then the right-hand side of the equation 
is Lipschitz continuous and, as discussed in Sect. 9.1, the equation has a unique 
solution. By choosing the initial value equal to the unit vector e; € C” pointing in 
direction j, for j = 1,...,m we can thus obtain n independent solutions y; of the 
equation. The matrix formed by the column vectors yj 


Y(t) = Di0,- nO] (12.7) 


is called principal fundamental matrix of the system and satisfies the matrix equation 
d 
a = A(t)Y, YO)=T1. (12.8) 


Knowing Y, the solution of the initial value problem is thus given by 


y@t)=Y¥)y +120. 
In addition, since the columns of Y are independent at all times, det(Y (t)) Æ 0 at all 
times. The inverse of Y, Y~!, is thus well-defined as is the evolution operator (also 
called state transition matrix) 


U(t, t) := Y(t)Y '(t). (12.9) 


Note that the evolution operator satisfies 


Sag t= ($row) Y! (t) = ADY HYT! (T) = AHU (t, 1) 
d `”? dt , 


and 
U(r, tT) =1 


and is thus the principal fundamental matrix of the system at time t. From (12.9) we 
also immediately obtain 
U(t, AU, tT) = Ut, T) 
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and 
U(t, t) = [U t, D! . 


The initial value problem described by (12.5) and initial conditions y(to) = yo 
can be translated in the language of distributions by extending the functions by zero 
fort < tọ and by replacing the differential operator by the distributional one 


Dy = A(t)y + yoô (t — to) (12.10) 
as usual. Differently from the case where A(.) is constant, this equation can not be 
written as a convolution equation. For this reason and since for arbitrary distribu- 
tions multiplication is only well-defined with smooth functions, for the equation to 
be well-defined the functions q;;(.) must belong to &. This may seem like a very 
serious limitation, but remember that any distribution can be approximated to arbi- 
trary accuracy by such a function (see Sect. 3.3). In this case the fundamental (or 
elementary) solution of the equation relative to time t is defined as the solution of 
the matrix equation 

LE, = I(t —T) (12.11) 
with L the differential operator 
L:= L(t, D) := D — A(t). 
If U (t, t) is the evolution operator of the original differential equation (12.5) then 


D10- DUM, tT) = 6(t — DI +1,0- AUC, T1) 


shows that 
E (t) =1,(t —1)U(t, T) (12.12) 


is the fundamental solution relative to t of the above distributional equation and 


y(t) = En (t)yo (12.13) 


is the solution of (12.10). 


12.2.2 Formal Solution 


We now look for an explicit formal solution in D'_ (R, C”) of the equation 


Dy = A(t)y + yoô + x (12.14) 
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with A(.) a matrix of functions in & as before. As a first step we rewrite the equation 
as an integral equation. To do this we write Dy as Dé x y and convolve both sides 
of the equation with 1, to obtain 


y—14*(AMy) =yol, +14, "x. 


Thus, if x is a bounded regular distribution then the equation can be written as 


t t 


yt) — J A(T)y(T)dt = yo14(t) + / x(t)dr. (12.15) 


0 0 


Instead of solving this equation directly, we consider a more general integral equation 
and then specialise to this case. 
A Volterra integral equation of the second kind is an equation of the form 


t 


y(t) = fre t)y(t)dt + x(t), t>0 (12.16) 
0 


with x a given regular distribution in D', (R, C”), k an n x n matrix of continuous 
functions [k;j], i, j = 1,..., and y the required unknown in D‘, (R, C”). This 
equation can be solved by an algebraic method based on a group [33, 34]. 
Definition 12.2 (Group) A group is a pair (G, e) consisting of a non-empty set of 
objects G and a binary operation e, usually called the group multiplication, satisfying 
the following properties 


1. e is associative: (g; © g2) @ g3 = g] © (22 © g3). 
2. e has an identity element e: g è e = e è g = g. 
3. Every element g € G has an inverse element g~! € G: 


geg =g eg=e. 


Note that the unit element is unique, since if e’ is a second unit e = e e e’ = e’ shows 
that it must be equal to the first one. A group G acts (from the left) on a non-empty 
set X if there is a function 


GxX>X, (e, x)= g-x 


such that the following hold: 


l. e-x =x forall x € X. 
2. g1- (82: x) = (81 è g2) - x for all g1, g2 € G and x € X. 


Let now k(t, t) be ann x n matrix of functions [k;;], i, j = 1, .. ., n continuous 
in the two variables t, r, with O < t < t and x a locally bounded, locally integrable 
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function in D’, (R, C”). That x is locally bounded means that it is bounded on every 
finite interval. We define the operation of J + k on x by 


(+k)-x:=x(t)+ I k(t, X(T) dt. 
0 


The resulting function is again a locally bounded, locally integrable function in 
D',(R, C”) as x and the elements J + k can be made to form a group. A suitable 
group multiplication can be found by writing 


t 


Q +k) [U + k2): x] =x) + fra, t)x(t) dt 


0 
t t Ti 


A / k(t, t)x(t) dt + if ARR i REMC T 
0 


0 0 


and noting that 


t 


t 
fre n) f ki tx) dr dt, 
0 0 


t t 
= J fre Ti)ka (T1, T2) dt x (T2) dt . 
0 n 


Since the inner integral on the right-hand side results in a matrix of continuous 
functions, we can define the group multiplication by 


(I+ki)o(I+k):=1I+k +k:+k xk 
with 
t 
kı x k(t, 1) = f bt DeO tda. (12.17) 


T 
For convenience we also put 


k x x(t) := k x x(t, 0) := fro, T)x(t)dt (12.18) 
0 


so that we can write 
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I+k)-x=x+krex. 


The unit of the group is readily seen to be J. 
It remains to show that every element of the group J + k has an inverse (J + k)7!. 
From the similarity with the geometric series we infer that the inverse is given by 


+O 1+ 0b" (12.19) 


n=1 


and show that this series converges in every interval0 < t < t < T. By definition, for 
every locally bounded function x = (x1, . . . , Xn) and every finite interval 0 < t < T 
we can find an upper bound given by 


Pr(x) := max { sup |x: (t)|} 
ISi<n Q<t<T 


so that, given the linearity of k, 


pr(k xx) < prtk) pr(x) T 


with 


n 


pr(k) := max > sup |k;j(t, T)| 


jal O<Tt<t<T 


Thus 
pr(k xk) < pr(k? T 


and by induction 
n—-1 


(n—1)! 


Pr(k™) < pr)" 


This upper bound is the nth term of a convergent series and implies the converges 
of (12.19) for every value of T. Having established convergence one immediately 
verifies that indeed 

A+k)jo(I—-k+k?F.---)=I 


and 
(I—=k+k? =: eT +e! 


With this group the Volterra equation (12.16) can be written as 
(I-—k)-y=x. 


and is solved by multiplying on the left with (J — k)~! 
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y(t) = x(t) + wx x(t) (12.20) 


[0,6] 
w:= X K”. (12.21) 


n=1 


The matrix function w is called the resolvent kernel of the equation. 

The group can’t be extended to a ring or an algebra with the natural addition as 
these would include the elements k. These elements pose two problems. First, the 
inverse of these elements are not necessarily functions. For example, the inverse 
of (t — t)"—!/(m — 1)! is DS and for singular distributions multiplication is only 
defined with functions in &. Second, such a ring includes zero divisors. From now on 
we will generally drop the symbols e of group multiplication and - of group operation 
as is commonly done with multiplication symbols. 

We now come back to the special case of (12.15) for which 


k(t, t) = A(t). 


The solution is given by (12.20) with 
t 


k” x (ono + fro ar) 


Th-1 


0 
t ti 
= f f [Ae Aminan yo 
0 0 


0 


Tn-1 


+f fof AAt f xa drana. 
0 0 


0 0 


These expressions can be written more compactly by introducing the notion of a 
time-ordered product of operators. We define T{A; (t1) -+ - An(Tn)} as the product 
with factors arranged from left to right in order of decreasing times. For example 


ATAT) TT] = TR 


T{A (ti) A2(t2)} = peeves a<n. 


With this meta-operator we can now write 


t 


r{(f aca) | =f | riada dnan 
0 0 0 
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-f | AA) dr dt + j / AA) dep de} 
0 Tt 
- J | aeae dtz dt, + / | A(t) A(t) dt, dt 
0 0 
t t 
= 2 | f A(t) A(t) dt. dt, . 
0 0 
and more generally 
t t Tn-1 
r{([ A(t) dr) | = nf E I A(t1) +++ A(t) dt,- dey (12.22) 
0 0 0 
because there are n! possible orderings of the n times T1, ..., Tn. Using these expres- 
sions we have 
td 
1 n 
«10 = <7 ( J A(r)dr)'| yo (12.23) 
0 
and 
t 
ee dt 
t Ay Ane 1 
=f J A(Ay)-- aoo fro dt da, 
0 0 
t t Ay 
= [| ic Í A(à1) <- A(Àn) dàn -dà x(t) dt 
1 n 
= —T{( laoa) Jax. (12.24) 
n! 
The solution of (12.15) can thus be written in the simple form 
y(t) = Eo(t)yo + E(t) x(t) (12.25) 
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with 7 
E,(t) =14(t—1)T {ef 4%} (12.26) 


the fundamental solution of the equation relative to t and where we have made 
explicit the fact that for t < T it is zero. 

In the special case in which A(.) commutes with i A(A) da the time ordering 
operator has no effect and the solution of the equation is a direct generalisation of 
the solution obtained using the method of separation of the variables for the scalar 
equation 

E,(t) = 1, — re AOA, 


In particular this is the case if A(.) is constant, in which case the fundamental solution 
becomes 
E,(t)=14(@t—r)e4*, = Aec™ 


and the expression for the solution y becomes a convolution identical to (8.11). 
For this particular case it is interesting to observe that, for a small-time increment 
At, the evolution from an initial state yọ can be approximated (to first order) by 


y(At) © (I + AAt) - yo 


so that, by iteration 
y(nAt) © (I + AAt)” - yo. 


Now if we set At = t/n we obtain that, in the limit as n tends to infinity 
f en A 
lim(7+A-) =e“. 
n>0oo n 


The fundamental solution of (12.14) given by (12.26) can also be interpreted as 
a matrix function of the two variables t and t 


W(t, t):= 140 —7)T fetaui] , 


As every element of the matrix is locally integrable, it is also a regular distribution 
that can be applied to test functions € D(R?). In particular, we can choose test 
functions of the form y(t)x;(t) with y, x; € D(R), j = 1,...n in which case we 
obtain 


J [we t)W(t)x(t) dt dt = [ [we t)W(t) dt x(t) dt 


—OO —0O —OO —0O 
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with x = (x1, .. . , Xn). The inner integral on the right-hand side evaluates to a matrix 
of indefinitely differentiable functions in & [16, 35]. For this reason and remembering 
that every distribution f is the limit of a sequence of indefinitely differentiable 
functions (for example fm = f * Pm with Sm the test functions of Example (2.4)) we 
can extend W x x by continuity to operate on vector valued distributions in &' (R, C”) 
by defining it as the distribution satisfying the system of equations 


n 


[0,6] 

((W xx), Y) Da T wij(t, ovod), ANS (a (12.27) 
j=l aes 

The thus extended linear map Wx is a distribution valued continuous function 


Wx: E' (R, C”) > D' (R, C”). 


With this definition we obtain for example that the solution of the equation with 
an input signal 
x = yoô(t — to), yo € C” 


((W + y08(t = 10)),. =D f wea 
j=1 % 


or 
W x yod(t — to) = W (t, to) yo - 


This shows that the matrix W plays a similar role as the fundamental solution E+ (t) 
and is called the (two-sided) fundamental kernel (or elementary kernel) of the differ- 
ential operator D — A(t). Italso shows that, as with LTI systems, the initial conditions 
can be absorbed in the input vector signal x. 


Example 12.1: Oscillator with Increasing Resonance [33] 
Consider an ideal oscillator with a resonance frequency increasing with the square 
root of time 

D'y +oity=x (12.28) 
to which we apply an input signal 


x = yoDô + yıô 


corresponding to initial conditions y(0) = yo and y'(0) = yı. 
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The equation can be rewritten in the state-space form by defining the state 


a [b] 


Du = A(t)u + Bô, y = Cu 


to obtain 


with 
0 1 yo 
A(t) = Ee 4 , B= >| >, C=[10]. 


In essence we need to calculate W (t, 0). Using (12.23) and remembering (12.22) 
we have, for n even 


_ r/o 0 
7 
-—1 Ak 

A(t)" x14) = tO Ra | even 
Tez bx 
and for n odd 
0 PO-D/24 on! 
xn "T by 
A(T)" x14(t) = pPO+D/2-1 i+ T= bx ; n odd 
Trai a 0 


where (a, )x>1 and (b;),>1 are the following sequences of integers 


(ag )e>1 = (2, 3, 5, 6, 8, 9, 11, 12,...) 
(by )e>1 := (3, 4, 6, 7, 9, 10, 12, 13,...). 


The fundamental kernel at (t, 0) is thus 
(oe) 
W(t,0) =1+) A0)” #14.) 
n=1 
ot ottó okt’ 
-| =e Fey ea |: 
o 0 casts ard hae 0 ET 


The series can be recognised as linear combinations of the Airy Ai and Bi functions 
and their derivatives Ai’ and Bi’ 


W(t, 0) = [wo(t) wi(t)] 
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Fig. 12.1 Solutions of 1 
(12.28) for yo = 1, y, = 0 
and wo = 27 0.8 
0.6 
0.4 
z 0.2 
> 


with 
PETE 316r (2/3) | (V/3Ai(—tag/*) + Bilton) 
DT 2 -w P (VAi (— to) + Bil(— to )) 
and l i ag 
r(1/3) 3AI(— tog )— tilts tay”) 
wilt) = 53075 F 2/3 2/3 
2-3 er ee tol) 


The signal of interest y is thus given by 
y(t) = CW(t,0)B. 
Specifically, for yọ = 1 and yı = 0 (see Fig. 12.1) 


3!/6r (2/3) 
2 


y(t) = (VBAC to) + Bi(—twy*)) . 


The full fundamental kernel W (t, t) can be obtained using Eqs. (12.9) and (12.12) 
and computing the inverse of W (t, 0) 


W(t, t) = 14.(t — tT) W(t, OLW (rt, 0)] | 
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12.2.3 Perturbation Theory 


The solution of (12.14) presented above is of great theoretical value. However, when 
it comes to solving practical problems it is in general very difficult to find a closed 
form for the fundamental kernel W(t, t). In many situations the problem at hand 
looks similar to a solvable problem, but with additional terms. If those terms are 
small in comparison to the ones of the solvable problem then one can obtain a good 
approximation to the solution of the problem by the following perturbative method. 
Suppose that the matrix A(.) can be split in two parts: one that leads to a solvable 
problem and that we denote by Ao(.) and one with relatively small elements, the 
perturbation term, that makes the equation unsolvable and that we denote by A(.) 


Dy =[Ao(t) + AM]y +x. 


Let Wo(t, T) be the fundamental kernel of the solvable part of the equation, Yo(.) its 
principal fundamental matrix, that is, the solution of the matrix equation 


DYo(t) = Ao(t)Yo(t), = Yo(O) = J. 
and let express y in terms of a new vector y defined by 
y=Yo(t)y. 
Then the equation becomes 
DYo(t)¥ + Yo(t)D¥ = (AoW) + AOIS + x 


which reduces to 
D = Q05 + Yg (Ox 


with y 
O(t) = Yy (HAH) Yo(t). 


This equation has the same form as the original one. Its solution is therefore given 
by 
sh =1,0- tyr fel aay [OO]. 


The advantage that we gain is the fact that, if the elements of A(.) are small, then 
the series expansion of this solution converges very quickly. Differently from this, 
to obtain a good approximation using the series of the original formulation of the 
problem requires a large number of terms (compare with Example 12.1). 

If x is composed by regular distributions then the first terms of the solution of the 
equation are given by 
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y(t) = Yo(t) i Yo(a)x(a) da 
0 


+r f f acy Odd 
0 0 


t 


= | wensaas | f wor t)A(t)Wo(t, A)x(A) dA dt +---. 
0 0 0 


The first term that we denote by yo is the solution of the unperturbed equation. In 
general, it is given by 
yo = Woxx. 


The next term is the first order perturbation term that we denote by yı. Note that 
it can be expressed as the action of the unperturbed system on an input signal x, 
constructed by multiplying yo by the perturbation A 


y= Worx, x(t) = A@yo(t). 


Similarly, the nth order perturbation term can be represented as the action of the 
unperturbed system on an input signal obtained by multiplying the perturbation term 
of order n — 1 by A 


Yn = Wox Xn,  Xn(t) = A(t)yn-1 2) - 


The output of the system 


y(t) = >> yn) 
n=0 


can thus be calculated iteratively starting from the response of the unperturbed system 
where each successive term is the result of multiplying the output of the previous 
term by A and feeding it back as input of the unperturbed system. This reminds of 
a feedback system with the unperturbed system playing the role of the forward path 
and A of the feedback one. 


12.2.4 Non-smooth Coefficients 


For several applications the requirement of differential operators with indefinitely 
differentiable coefficients is too restrictive. In those situations it’s useful to work in 
the subspace of D’ constituted by distributions that are m times differentiable and 
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denoted by D”. These distributions are said to be of order m and are the continuous 
linear functionals on the set of m times continuously differentiable functions with 
compact support D”. Convergence is defined in a similar way as for distributions 
in D’. 

Given a distribution f € D™ the product of f with an m times continuously 
differentiable function g is well-defined 


(fg, p) = (f, 8) 


since if o € D” then g¢@ is also in D”. Note that we can exchange the roles of f and 
g and still obtain a well-defined multiplication. Thus if f is an m times continuously 
differentiable function, it can be multiplied by a distribution of order m. 


Example 12.2: Dirac Distribution 


The Dirac distribution ô belongs to D° and its multiplication with continuous func- 
tions is well-defined as long as one restricts considerations to D”. 


12.3 Impulse Response Generalisation 


In the previous section we saw that differential equations describing LTV systems 
aren’t convolution equations. In spite of this we found that the solution of the equation 
can be written with the help of the operator x acting on a (matrix) function charac- 
terising the system (the fundamental kernel) and the input vector x. In particular, for 
a system described by the state-space representation (12.3)-(12.4) with D(t) = 0, 
the input-output characteristic is given by 


y(t) = C(t)14.(t — TT fek 4%) & B(t)x(t) 
= C1, — T fel 4%) B(T) x x(t). 


This expression highlights how in LTV systems the operator x is the natural operator 
taking the place of convolution in LTI systems. However, because the Fourier- and 
Laplace-transform convert convolutions into products and because for many pur- 
poses the frequency domain characteristics of a system are more interesting than the 
time domain ones, in engineering circles it is common to express the input-output 
characteristic of LTV systems in terms of a convolution like operator as we did in 
Sect. 12.1. This is easily done by the change of variable 


E=t-T 
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and by defining the time-varying impulse response A(t, £) as a function of the vari- 
ables ¢ and £ 


t 


y(t) = [rer-oxeras, 


0 


ht, E) =COWET fel 4} Be - g) 


where we have assumed x to be a regular distribution in D’,. Note that, while the 
above integral looks very similar to a convolution, it differs from the convolution 
that we defined in Sect. 3.2. A generalisation of the above convolution like operation 
for LTV systems is obtained by adapting (12.27) and defining it as the distribution 
satisfying the following equality 


(h *: x, $) = (x(), (h(t, t — £), 6(@)) (12.29) 


where we have generalised the inner integral of (12.27) to the application of a parame- 
terised distribution to the test function @. Since this operation shares several properties 
with convolution, the operator *, is called the convolution product for time-varying 
systems. In the technical literature it is most often called convolution and denoted by 
the same symbol as the one used for convolution. In the following we will also often 
simply call it convolution, but maintain the use of the symbol x; to make it clear that 
it is not the operation defined by (3.6). 

In the special case in which the time-varying impulse response is the product of 
an indefinitely differentiable function f and a distribution g 


A(t, £) = fs) 


the convolution product for time-varying systems can be expressed in terms of a 
proper convolution by 


(h x: x, $) = (g * x, fo). 


In the previous section we discussed the fact that, for systems described by a 
differential equation, the application (h(t, t — £), ¢ (t)}) appearing on the right-hand 
side of (12.29), regarded as a function of the parameter £, is a function belonging 
to &. For this reason, for the equation to have a meaning, x must be restricted to 
distributions in &’. However, if we define a function y € & bounded from the left 
with y(t) = 1 ina neighbourhood of [0, o0) and assume A to be such that 


E > y(E)(ht, t — £), 6) 


is a Schwartz function for every ¢ € S, then (12.29) remains valid for right-sided 
tempered distributions 
xEeS NAD. 
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Note the similarity with the definition of the Laplace transform and the fact that, as 
for the Laplace transform, the value of the distribution does not depend on the choice 
of y. For this reason and as is commonly done for the Laplace-transform, we will 
generally not write y explicitly. 

Before concluding this section we note some properties of the operator *,. The 
first is that it is associative 


(hp *; ha) * x = hp *; (ha *; x) 


with A4 and hg the time-varying impulse responses of two systems. This is a direct 
consequence of the fact that «, is related to x by a simple variable transformation and 
by the definition of the latter (see Eqs. (12.17) and (12.18)). 

A second important property, or rather the lack of it, is that, x; is not commutative. 
Therefore, differently from LTI systems, the order of LTV systems is important. As 
an example consider the cascade of a low-pass filter with a 3 dB cut-off frequency of 
3ap and the frequency shifting system of Example 12.4 with wo >> @3ag. Suppose 
that the system is driven by a signal with a frequency falling in the pass-band of 
the LPF. Then if the signal passes first through the LPF and then into the frequency 
shifting system, the output will have a large magnitude. Differently from this, if the 
input signal first passes through the frequency translating system then the signal at 
the input of the LPF will lie in the stop-band of the latter and will appear much 
attenuated at its output. 


12.4 Time-Varying Frequency Response 


12.4.1 Definition 


Consider a system described by the time-varying impulse response h(t, £). Under the 
assumption that the input signal x is a right-sided tempered distribution the system 
response y can be written as 


(y(t), $) = (FHF), J har -O60 di) 


-koz f [ rer- 90 are as) 


—00 —0O 


-koz f frer- pe agor bar) 


=00 =00 
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= (o, x J haoc" gaa) 


= (she w) el x x(t), 90) 
27 


or 
1a 
y(t) = — hlt, w) e * X(t) (12.30) 
27 


where h(t, @) is the Fourier transform with respect to £ of h(t, €) and is called the 
time-varying frequency response of the system 


[0,9] 


A(t, o) := f h(t, €)e dé. (12.31) 


=ù 


In particular, for regular distributions we have 


CO 
1 A 
y(t) = — f h(t, w)X(@) e” da. 
20 
—0o 


It’s easy to check that for real systems the time-varying frequency response at —w is 
equal to the conjugate complex of the value at w 


A(t, —o) = A(t, w) 


for each value of t. 

To obtain a physical interpretation for A(t, w) we apply a complex tone e7% 
as input signal. This is allowed because periodic distributions are isomorphic to 
distributions with compact support (see Sect. 3.4). With this input signal the output 
of the system is found with the help of (12.30) to be 


y(t) = h(t, wpe 


and suggests the interpretation for the time-varying frequency response A(t, @) as 
the complex envelope at wo of the output signal (see Fig. 12.2). 

If the output signal y is a tempered distribution it can be Fourier transformed. A 
useful expression relating the spectrum of y and the one of the input signal x can be 
obtained by expressing y with the help of (12.30) 
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x(t) = R{ele"} y(t) = R{AG, we!" } 


=~ 


(t, w) IAPA) ar) 


N 
y|- 


A(t, w)e Y dt o(w) au) 


N 
y|- 


Âw —w, w) p(w) dw) 


| oo 
(arf 
(af 
| 


= Laie —W,W)*X(w), ou] 
20 
or 1 a 
3(w) = —h(w — vw, w) x ê (w) (12.32) 
27 
with = 
en w) := I h(t, œe" dt. (12.33) 


The function Å is the two-dimensional Fourier transform of the time-varying impulse 
response / and, in the context of communication systems, is called the doppler-spread 
function. Equation (12.32) shows that the input and output spectra of an LTV system 
are related by a convolution like operation. In particular, for regular distributions 
they are related by the following integral 


3(w) = a | hwo, ooa. 
27 


For tempered distributions the time-varying impulse response A, the time-varying 


frequency response h and the doppler-spread function hare isomorphic to each other. 
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For this reason an LTV system with a tempered time-varying impulse response can 
be described by any of these functions. 


Example 12.3 


In this example we investigate the relationship between an LTI system to which we 
apply a right-sided input tone and an LTV system activated at t = Os and driven by 
a tone, resulting in equal output signals. 

Consider an LTI system described by the differential equation 


Dy+ay=x, a>0O 


to which we apply the signal x(t) = 1, (t)e/®’ € D!,. The response of the system 
can be calculated with the help of the Laplace transform. The transfer function of 
the system and the Laplace transformed of the input signals are 


1 
EO) a Ris} = =a 
and i 

X (s) = ——_., R{s} > 0 


S— Jo 


respectively. The system response is thus found by inverse Laplace transforming 


1 
Y(s) = H(s)X(s) = GO —jo)’ R{s} > 0 


which gives 


e7% 
y(t) = = a 1). 
a+ Jo 


We now re-interpret the system as a time-variable one consisting of the above LTT 
system and an ideal switch at its input. For £ < 0 the input is disconnected from the 
system (switch open) which therefore produces the constant output signal y(t) = 0. 
At t = 0 the input signal is connected to the input of the LTI system by closing the 
switch. The full system is therefore described by the differential equation 


Dy + ay = 14(t)x. 
The input signal is now the complex tone x(t) = e/®". 
To obtain the system response we first compute the time evolution operator U (t, t) 


which is the solution of 


Dy +ay = ô(t — 7T), t>t>0 
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and given by 
U(t, t) =14@)14¢-—ne%*™. 
With it the response of the system to the input x(t) = e/® is calculated to be 


t 


y(t) = U(t, t) x x(t) = eas 
0 


_ ew (ecto — 1) 
a+jo 


which of course agrees with the calculation through Laplace transform. However, 
with the new interpretation we see that the system posses a time-varying fre- 
quency response h(t, œw). The easiest way to calculate it is through the relation 
y(t) = h(t, we! and we obtain 


e Uto) 
(erron = 1) , 
a+ jo 


This shows the relationship between h (t, œ) and the LTI frequency response H (jw) = 
1/(a + Jæ). Differently from the latter, h(t, œw) includes the full information about 
the variation in time of the system. In this particular example, about when the switch 
is closed. 


h(t, œ) = 


Example 12.4: Frequency Translation 
Consider a system described by the doppler-spread function 
h(w, w) = 2x8(w — wo). 
According to (12.32) the spectrum of the output signal is given by 
(w) = d(w — wo — w) x X(w) = X(w — wo). 

Therefore, the effect of the system described by the above doppler-spread function is 
to shift in frequency the spectrum of the input signal by wo. Such a device is referred 
to as a mixer. 

The time-varying frequency response and the time-varying impulse response cor- 


responding to this delay-spread function are easily calculated to be 


A(t, w) = et 
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e/ wot 


x(t) y(t) 


Fig. 12.3 Block diagram of a frequency-translating LTV system 
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Fig. 12.4 Ideal sample and hold 


and 


A(t, E) = e/""5(E) 


respectively. If we apply a complex tone e/®” as input signal we can calculate the 
output signal from the former and (12.30) as 


y(t) = Letro * 27 5(t — wy) = eJ (wotao)t 
T 


or from the latter and (12.29) as 
y(t) = S (E) x, el Oot — e7 (wWo+oo)t 


In both cases the angular frequency of the input tone is shifted by wo as expected. 

The time-varying impulse response shows clearly that the system is memory-less, 
that is, the value of the output signal at time t only depends on the input signal at 
time t. The effect of the system is to simply multiply the input signal by the complex 
tone e/” as illustrated in Fig. 12.3. 


Example 12.5: Sample and Hold 


In this example we consider an ideal sample and hold: the output of the system is 
constructed by sampling the input signal x at regular intervals 7 and by holding the 
value of each sample constant for the duration of a period 7. Sample and hold blocks 
are used for example at the input of analog-to-digital converters (ADC) to give the 
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converter enough time to compare the value of a sample with one or more reference 
signal levels. The operation of a sample and hold is illustrated in Fig. 12.4. 
The ideal sample and hold is characterised by the following time-varying impulse 


response 
oo 


h(t, £) = ôr -Er = $ 80- E- nT) Ir 8) 


n=—oo 


with 
10<é§é<7T 
1 = T 
7G) 0 otherwise. 
Note that in this case (12.29) doesn’t make sense as h is a singular distribution and 
in the right-hand side expression x is not applied to a smooth function. To give a 
meaning to 


y(t) = h(t, E) x x(t) 


we have to restrict the input signal x to belong to &. Then we can write 


(y, $) = (A(t, E) * x(t), PO) 


5 EFT 


"a (KOE —a7), J boai) 
Ẹ 


n=— 00 


(oe) 


= } xaT)(Ir(t —2T), oO) 
or 


(oe) 


y(t) = Yo xMT)i-(t— nT) 


n=—oOo 


and we obtain the desired system response. The system response can also be written 
as a (proper) convolution 


1 
y(t) =T dr (t)x(t) * gir). 
From this expression, assuming x to be Fourier transformable, it’s easy to compute 
the output spectrum. From (4.14) we read that the Fourier transform of T ôy x is the 
convolution of the transforms of the factors divided by 27r 


FIT êr x} = ĝo, * 2 


with œw; the sampling angular frequency 27/7. Thus, the output spectrum is 
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1, 
$w) = [bo * îl) 17) 


with 


v 


Îr lo) = T —— 2.7 


v 
Ws 


This expression shows clearly the effects of sampling and of holding in the frequency 
domain. The operation of sampling is represented by the factor in square brackets. Its 
effect is to produce an infinite number of copies of the spectrum of the input signal 
shifted by multiples of ws 


(oe) 


ôo, *X(W) = X(@ — nas). 


n=— 00 


If the original signal has to be recovered from the samples then one must avoid (or 
reduce to negligible levels) overlapping between the copies. This amount to saying 
that the power of the input signal residing outside the frequency range (—@, /2, ws /2) 
must be negligible. Or, in other words, the sampling frequency must be at least twice 
the frequency of the highest component of the input signal spectrum containing 
a non-negligible amount of power. This is the statement of the famous sampling 
theorem. If this condition is satisfied then the input signal can be recovered with 
the help of a low-pass-filter eliminating the copies with n 4 0. When the copies of 
the input signal do overlap one says that sampling causes aliasing. Note that, if the 
spectrum of the input signal x only occupies a small fraction of the frequency range 
(—@,/2, @;/2) then one may find a sampling frequency lower than w, not causing 
aliasing. 

The effect of holding act as an LTI filter introducing a delay of 7/2. The filter 
has a low-pass characteristic with notches at multiples of w,. The effects of sampling 
and of holding on the spectrum on a signal are illustrated in Fig. 12.5. 

The need to restrict x to being an indefinitely differentiable function may seem like 
excess of rigor. Note however that if x is not continuous at the sample instants n7 then 
the problem is not “merely” a mathematical one, but any physical implementation 
will fail to work properly. This is so because if the input signal varies very rapidly 
compared to the actual speed of the physical sampling switch, then the value of the 
sample will be affected by many implementation details and in particular by noise. 
The result is a system producing unpredictable sample values. 

From a mathematical point of view one may enlarge the type of allowed input 
signals to the class of continuous functions. Then the system response is mathemat- 
ically well-defined, but it’s not a distribution anymore. In fact, the value of a Dirac 
impulse is defined as the value of the test function at zero. If we multiply the test 
function with a continuous function, the value is still well-defined. However, we 
can’t expect to be able to compute the derivatives of the output signal. Compare also 
with Sect. 12.2.4. 
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Fig. 12.5 Illustration of the 0 - 
effect of sampling and of a campled 
sample and hold on the = output 
spectrum of a signal -10 
-15 
S -20 
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We started this section by performing a calculation leading to the definition of the 
time-varying frequency response of a system and a relation expressing the output of 
the system in terms of it. If we assume Laplace transformable, right-sided signals 
and redo a similar calculation replacing the Fourier transform by the Laplace one we 
obtain the time-varying transfer function of the system 


(oe) 


H(t, s) = fhe pe dé Ms}>o. (12.34) 
0 


With it the output of the system is given by 


y(t) = | pq, s)e” x X (s). (12.35) 
2 J 


12.4.2 Differential Equation 


Consider again a linear time-varying system whose state u is described by the system 
of differential equations 

Du = A(t)u + B(t)x 
and assume that it is driven by a complex tone 


x(t) =e”. 


From (12.30) we know that the components of the state u can be represented by 
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uilt) = aj (t, we’ , E= hesti 
Inserting this representation for u and the complex tone for x in the equation we 
obtain 
Dii(t, we” = AHA, we” + BMV . 
The left-hand side can be written as 
Dû(t, we! = 1” (Ja + D)û(t, w) 
so that we obtain an equation for u(t, œ) 


(Jæ + D)i(t, w) = Aut, w) + BY). (12.36) 


With a(t, œ) we can directly obtain the time-varying frequency response of the system 
without having to first compute the fundamental kernel 


h(t, o) = C(Na(t, o) + Dit). 


In particular, if the system is described by a (possibly) higher-order differential 
equation 
Lit, D)y = N(t, D)x 


with 


L(t, D) = D” + am- (t) D"! +--+ + ao(t), 
N(t, D) = b (t) D" + bp (t)D" | + -- -+ bolt) 


we can directly obtain an equation for the time-varying frequency response of the 
system by replacing the differential operator D in L by the operator Jæ + D and in 
N by Jæ [36] T 

L(t, jæ + D)h(t, w) = N(t, Jæ). (12.37) 


Note that this formulation in terms of distributions and distributional derivatives 
takes care of the initial conditions automatically. If one works with functions and the 


standard derivative then the initial conditions for the problem are obtained from 


y(t) = h(t, we’. 
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Example 12.6 
Consider a system that is switched off up to time t = 0 (y(t) = 0, t < 0) at which 
point it is turned on and is then described by the differential equation 


Dy+ty=x. 


We are interested in the time-varying frequency response of the system. We compute 
it in three different ways. 

First we compute it via the time evolution operator U. For t > t > 0 it is found 
by solving the differential equation 


Dy+ty =d(t—T). 
As can be verified by inserting it into the equation, it is given by 
U(t, T) = ent [240 /2 i 


To obtain the time-varying frequency response we apply the input x(t) = e7®* and 


obtain 
t t 


y(t) = f ve. T)x(t)dt Se] ae 


0 0 


From this and 7 
y(t) = h(t, we! 


we deduce that 
t 


a ea) ya 2 
h(t,w) =e t? /2 le [24JoT dT. 
0 


The time-varying frequency response of the system can also be obtained by Fourier 
transforming the time-varying impulse response. The latter is obtained from the time 
evolution operator using the variable substitution £ = t — t 


A(t, E) = 14. (€)14 (Net 74"? 


where we made explicit that for t < O the response of the system vanishes. The 
time-varying frequency response is thus 
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°S t 
h(t, o) = f repetas = id ode 
—00 


t t 
t? 2 = pps 2 
=e t Pye [26 Jott Ddr =~ t? /2. we lies [2+jot qr 
0 0 


which matches the one obtained with the previous method. 
A third method to compute the time-varying frequency response is by solving the 
corresponding differential equation 


(D+ jo)h + th =14(0). 


The solution is 
t 


A(t, w) = fe Pe] eae, 
0 


as is verified by inserting it in the equation and where we made explicit that fort < 0 
it is zero. 


12.5 Linear Periodically Time-Varying Systems 
12.5.1 Floquet Theory 


In this section we consider in more details linear periodically time-varying (LPTV) 
systems. In particular, we study systems that can be described by a state-space repre- 
sentation with matrices A(.), B(.), C(.) and D(.) having periodic smooth functions 
as elements. These include systems described by differential equations with periodic, 
indefinitely differentiable coefficients. 

Consider the differential equation 


y = A(t)y + B(t)x (12.38) 
with A(.) an n x n-matrix and B(.) ann x 1 one, both with 7 -periodic indefinitely 
differentiable elements and where, for brevity, we denote by y the (distributional) 


derivative of y and similarly for other quantities. Let further Y(.) be the principal 
fundamental matrix of the equation and 


U(t,t) =Y(t)Y \(t) 
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the evolution operator. From the periodicity of A(.) we obtain 


VET EF) = 9t +MY OD) 
=A(t+T)Vt+T)Y T) 
=A(HUt+T,T) 
from which, with U (t, t) = I and the uniqueness of the solution of the equation we 


deduce 
Ut+7,7) =U, 0) 


and 
Yt+T)=VYOYT). 


Let now introduce 
P(t)=Y(e"*, Fec 


with F ann x n matrix with constant coefficients and define the variable transfor- 
mation 
y(t) = P(t)z(). 


In terms of z the equation becomes 
P(t)z + PO = A(t)P(t)z + B(t)x 


or 
z= P| (NAP) — PO) + POBO. 


To simplify this equation we calculate the derivative of P 


P(t) =Y(the* —Ye"" F 
= AHY (He * —Y(the F 
= A(t)P(t)— P(t)F. 


Using this result in the previous expression we finally obtain 
ż = Fz + P (t)B(t)x. 


This equation is similar to the original one, but with the important difference that 
the periodically time-varying matrix A(.) of the original equation has been replaced 
by a constant matrix F. This shows that the evolution operator of any system of 
differential equations with A(.) a T -periodic smooth matrix can be represented in 
the form 

U(t, t) = P(e P7! (t), (12.39) 
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This is called the Floquet representation of the evolution operator. 
Let yo € C”, from the analysis of LTI-systems we know that ef yo is a linear 
combination of functions of the form 


pits 
with A; an eigenvalue of F and p; a polynomial of degree lower than the algebraic 


multiplicity of 4;. The Floquet representation tells us that the solution of (12.38) is 
a linear combination of functions of the form 


piee 
where p; are again polynomials, but in this case with J -periodic smooth coefficients. 


Example 12.7 


In this example we look for the solution of the equation 


Dy = A(t)y +x 
with 
_ | @3aB + Aw COS Ont 1 
i = | 0 wzag + Aw cos ont , 


In particular we are interested in the evolution operator of the equation as it allows 
us to calculate the solution for an arbitrary input x. 
First observe that A(.) can be written as a sum of two matrices 


_ |@3ap l Aw COS Omt 0 
A= | 0 ie a | 0 E i 


the first of which is constant, and we denote it by F. To find the principal fundamental 
matrix we make the ansatz 


Y = PH, P(t)=p(t)l 


with p an indefinitely differentiable periodic function with period 27x /w,,. Inserting 
this ansatz in the equation we find 


DY = D| pIe" ] 
= pIe + pI Fe" 


È 
=[F + 4r]. 
[F+ oro 


From this expression we see that it satisfies the equation if 
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È = Aw COS Wnt. 


The function p can be calculated from this equation and the condition Y (0) = I 
using the method of the separation of variables from which we obtain 


p(t) = aoe Sin mt 
The principal fundamental matrix is thus 
42 Sin @mt tF 
Y(t) = em iad aan 


With Y and using the results of Example 8.2 for e’” the evolution operator is found 


to be 7 
52 sin wmt 
er el 
, Ao : 


eon Sin mT 0 1 


12.5.2 Time-Varying Frequency Response 


Consider a SISO linear periodically time-varying system described by the state-space 
representation 


Du = A(t)u+ B(t)x (12.40) 
y = C(t)u + D(t)x (12.41) 
with A(.), B(.), C(.) and D(.) indefinitely differentiable 7 -periodic matrix func- 
tions. Thanks to linearity we can analyse the response of the system for D(t) = 0 
and add the contribution of D(t)x at the end. 
In the previous section we established that the evolution operator of (12.40) can 
be expressed in the form 


U(t, t) = P (tjet 7P" P7! (t) 


with P (t) an invertible, indefinitely differentiable 7 -periodic matrix function and F 
a constant matrix. Using this representation for the response of the system we obtain 


y(t) = 14t — CHAP HeT P T)BE) & x(t) 
or, in terms of the time-varying impulse response 


y(t) = hc * x(t) 
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Fig. 12.6 Representation of 
a stable LPTV system 
| 


|> Hls) 


x(t) ——=— > Hols) SQ + = y(t) 


|> H(s) m 


with 
helt, £) = EC HOP Hef P(t — EB — £). 


If we now add the contribution to the output from D(t)x we finally find 
y(t) = h *, x(t) 


with 
h(t, €) = hc (t, E) + DESE). 


The fist term Ac is a regular distribution growing at most exponentially with respect 
to € while the second has bounded support. The impulse response A is therefore 
Laplace transformable with respect to £. This implies that the system possess a time- 
varying transfer function H (t, s). H(t, s) is a function in the variables t and s and 
the above expression makes it clear that it is periodic in t. Therefore, with respect to 
t, we can expand it in a Fourier series 


H(t,s) = a H,,(s)e!"°7" 


n=—OOo 


with wy = 2x/T and H,,(s) functions of the variable s alone. 

The last expression shows that LPTV systems can be regarded as the parallel 
connection of LTI subsystems with transfer functions H, whose outputs are shifted 
in frequency by nw (see Fig. 12.6 and Example 12.4). This is best seen by applying 
a complex tone to a stable system. Thus, assume that all the eigenvalues of F have a 
negative real part, then the time-varying frequency response h(t, w) does also exist 
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and is also a regular distribution that can be identified with a function in the variables 
t and w. Proceeding as above we can write it as 


[0,6] 
Ált, w) = 5 fn loje” 


n=— 00 


with Ån (w) = H, (Jæ). If we now apply a complex input tone e7®% to the system and 
use (12.30) to calculate the system response we obtain 


[0,0] 
=) Lee, 


n=—C 


The output is thus seen to consist of a sum of tones at wy + na, n € Z, each one 
weighted by h,,(wo). It is readily seen that for a real system the following relation 
must hold 


Example 12.8: LPTV LPF 


In this example we examine a series RC low-pass filter (LPF) where, to reduce 
the physical area occupied by the circuit, the series resistor is implemented with a 
MOSFET. While this will produce some distortion, here we are interested in what 
happens if the gate bias voltage is disturbed by a periodic signal (see Fig. 12.7). 
This could happen for example if in a mixed-signal system (both analog and digital 
signals present) a line distributing the system clock is in proximity of the gate bias 
line, and the two are not properly isolated. The system is described by the following 
differential equation 


D t = t t ; t) = —_—_ 
[D + wsaz(t)|y = wag Ax (t) w3aB(t) ROC 
with y the voltage across the capacitor, x the source voltage and R(.) a periodic 
function. Given the periodicity of R(.), @3ag(.) is also a periodic function with the 
same period that we assume to be smooth. w3¢,(.) can therefore be expanded in a 


Fig. 12.7 RC low-pass filter O Vpias + Vint) 
with a PTV resistor | Dias int 


AO etp 


12.5 Linear Periodically Time-Varying Systems 291 
Fourier series that, for simplicity of analysis, we assume to be given by 
@zag (t) = wo + A@COS(@mt), w0, AW, Om > O 


with Aw & wo. We are interested in characterising the frequency response of the 
filter. 

The equation describing the system separates into a differential equation with 
constant coefficients and a small perturbation term 


(D + w) y + Av coslont)y = osag (x(t). 


We can therefore solve the problem using the perturbation theory that we developed 
in Sect. 12.2.3. In addition, instead of solving for the time-varying impulse response 
and obtain the time-varying frequency response by Fourier transformation, it is con- 
venient to solve directly the equation for the latter. Proceeding as in Sect. 12.4.2 we 
obtain 

(D +jJøo+ o)hG, w) + Aw COS(Mmt A(t, w) = zag (t) 


and we can identify — Aw cos(@,,t) with the perturbation term A and —(J@ + wo) 
with the matrix Ag of the unperturbed system. 

We start by computing the time-varying frequency response of the unperturbed 
system that we denote by ho(t, œ) and which has to satisfy 


(D + Jo+ wo)ho(t, w) = w + Siem! + enon) 


where we have represented cos w,,t by complex tones. Note that the variation in R(.) 
results in additional input tones to an otherwise time invariant system. The solution 
of the equation is readily calculated to be 


N Aw Wmt —JOp_t 
holt, w) = H(w) + zg O + ede’ m+ H(w— Ome!" 
w 


with 


H (œw) = 


the frequency response of the RC filter without disturbances (that is for w3ag (t) = 
wo). H (w) /wo plays the role of the fundamental kernel Wo of Sect. 12.2.3. However, 
because Ag is time invariant we can work in the convolution algebra of periodic dis- 
tributions and instead of the fundamental kernel, the system can be characterised by 
the fundamental solution of the equation. In this example the kth Fourier coefficient 
of the fundamental solution of the equation is given by (see Example 7.5) 
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_ Hw + køn), 
~ T wo ? 


ek 


T =2r/Om. 


We now calculate the first order perturbation term. The first step consists in cal- 
culating the new “input signal” produced by the perturbation A 


as A A A 
x(t) = A(Nho(t, w) = a + e5) holt, œ). 


The first order perturbation term of the frequency response Í (t, œ) is then obtained 
by applying this signal to the unperturbed system 


A A A 
(D + Jo + wo) hy (t, w) = -a + enV holt, w) . 
The solution of the equation is given by 
D Aw Omt =JOmt 
h\(t,@) = -z TOHO + Wm)e!?" + H(@ — wm)e A ] 
wo 


Ao \2 

20 
+ H (w + Om) H(@ + 2@m)e!?"" 
Sa on) Ho — Boneia] ; 


Note that both Áo and Á; include terms of order Aw. Since A is proportional to Aw 
and all terms of /, are proportional to powers of this quantity, no higher perturbation 
term will include a contribution of order Aw. The first two terms fio and hy are 
therefore enough to establish the effects of the perturbation of order Aw. To obtain 
an estimate to second order in Aw we would need to calculate ho as well. 

With these results the first order response of the system when driven by a tone at 
w is given by . f 

y(t) = [hot, w) + hit, oe”. 


It is comprised by tones at w + næm, n = —2, —1, 0, 1, 2. It’s not difficult to see that 

if we would calculate higher order terms we would obtain similar tones for larger 

values of |n| and in the limit, when including all perturbation terms, for all n € Z. 
Let’s consider more closely the component at w + @m 


A 
yı) = 2 HH + @m)[1 = H(o) |e! nto" 
2a 


Aw Ia 
— H (w + om) 2o - es @mtoyt 
1+ 5 


~ 2a 
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and assume that wm >> wo. If the filter is part of a transmitter and used to suppress 
noise outside the channel allocated to the user or service, then a 27 /@,,-periodic 
perturbation is seen to create spurious emissions that can fall in frequency ranges 
reserved for other users or services and violate the maximum allowed emission levels. 
From the above expression we note that a wide nominal filter bandwidth wọ helps 
in reducing the emission level caused by the perturbation. This can be interpreted 
intuitively as follows. If the input signal frequency is much smaller than the 3 dB 
cut-off frequency of the filter, then it produces a very small current flowing through 
the filer components and, in the limit of zero current, the output signal doesn’t depend 
on the value of the filter components. 

If the input tone is well above the nominal 3 dB cut-off frequency of the filter 
|@| >> wo then |H(w)| < 1 and the output tone at w + w,, can be approximated by 


Aw 
y(t) & —H(w + ome! Onto” ” 
2w0 


If the frequency of the input tone is such that |w + @,,| < wo then the tone falls in a 
spurious pass band of the filter and for |w + @m| K wo it can be approximated by 


MORN Aw e Onto) 
2w0 


If the filter is part of a communication receiver responsible to suppress interfering 
signals (the channel filter) then we see that 27r /@,,-periodic perturbations introduce 
spurious responses in the stop band of the filter at multiples of w,, that down-convert 
interfering signals in band, possibly masking the wanted signal. The amplitude of 
the dominant spurious response is proportional to the perturbation magnitude Aw 
relative to the nominal 3 dB cut-off frequency of the filter. 


Example 12.9: Quadrature (De-)Modulator 


Consider the frequency translating system of Example 12.4 with time-varying 
impulse response 

Amoa(t, E) = ef S (E) . 
It is a complex system in the sense that if we apply a real valued input signal its 
response is complex valued. In this example we show that the system can be imple- 


mented using two real sub-systems. 
Let’s decompose the input signal into its real and imaginary parts 


x(t) =r(t)+ jq(t). 


The system response is given by 
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Y(t) = Amoa(t, E) * x) = FA + sq (err 


and can be written as 


[r(t) + sq (t)] cos wot — [q (t) — Jr (t)] sin wot . 


In this form the system response is seen to be the sum of the responses of two real 
systems driven by correlated signals (see Fig. 12.8). By linearity, if the two systems 
are driven by the real part only of the input signals, that is by r and q respectively, 
then the response of the system is 


y(t) = R{[r@ + Jae}. 


The combination of the two real systems is called a quadrature modulator. Each of 
the two real subsystems is called mixer and effectively multiply the input signal with 
a second real valued signal / called the local oscillator (LO) signal. A mixer can 
therefore be considered a system having two input ports. 

Consider now a system that shifts the spectrum of the input signal in the opposite 
direction 


haemoa(t, E) = eer ae) . 
We would like to find a real system implementation that when driven by the signal 
rA + sq@)er"" 


allows us to recover the signals used at the input of the quadrature modulator used 
to generate it. Such a system is readily found by observing that 


1 
[r(t) + Jq ]e COS Wot = [r(t) + J405 le $ 1] 


and similarly 


[r(t) + Ja (He (—1) sin wot = [r (t) + TO C -=1]. 


Thus, if the signals r and q are band-limited to frequencies smaller than wọ, the 
original signals can be recovered (up to a fixed scaling factor) by use of two mixers 
driven by quadrature (orthogonal) local oscillator signals followed by low-pass filters 
(see Fig. 12.9). Such a system is called a quadrature demodulator. By linearity, if the 
system is driven by the real signal 


RA + Jale} 
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COS Wot 


r(t) + jq(t) È 
q(t) — yr(t) a 


— sin wot 


‘ VO = [r@) + qe" 


Fig. 12.8 Quadrature modulator 


COS Wot 


—™—)4 BE | Hr) + ya] 


(<)> BS > Halt) -ro 


— sin wot 


[r(t) + joe 


Fig. 12.9 Quadrature demodulator 


Fig. 12.10 a Typical local a 
oscillator unipolar waveform 
b Typical local oscillator 
bipolar waveform 


the two output signals are the real parts of what we found above, that is r/2 and q /2 
respectively. 


Example 12.10: Harmonic-Reject Mixer 


We saw in Example 12.9 that a mixer is a system multiplying the input signal with 
a T -periodic signal called the local oscillator signal 


A(t, E) = 108E). 
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K =| A(t) — yı 


HOXH ne) Hyi 


i ir-7TN4) 


Y 
-09 = A) yn 


Fig. 12.11 Generic N-path receiver 


I(t) 


-60 =| h(t) ii = wo 
It-T) g a 
I | | 
Pee i 
—|— > wy/2 
| 
ut-T XA) — |> WN 241 
zo 
-60 = h(t) >| WN /2-1 


Fig. 12.12 Quadrature N-path demodulator 


In practical implementations, to minimise the signal-to-noise degradation caused by 
the circuit, the local oscillator signal is not a pure sinusoidal. Instead, it is most 
often designed to approach a rectangular waveform as depicted in Fig. 12.10b. Being 
periodic the signal / can be represented by a Fourier series 


(oe) 


I(t) = > ane, wr = 2n/T 


n=—OoO 
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a=] 


for the waveform in Fig. 12.10a and 


0 n even 
an = 2 T dd 
= sin(a F) no 


with 


3j 


for the one in Fig. 12.10b. Therefore, a mixer driven by an input tone 
x(t) = ef Certo (12.42) 


produces an output tone at w; for every value of n for which a, 4 0. When the 
mixer is part of a receiver designed to down-convert a signal at w7 + w; to w; for 
further processing and detection, the spurious responses (n 4 —1) are undesired as 
they could cause an interfering signal to overlap in frequency with the desired signal 
and prevent reception of the latter. The spurious responses are most often suppressed 
by preceding the mixer with a suitable filter. However, in some situations such a 
filter is undesired. In the following we present a method to suppress the dominant 
spurious responses of a mixer without the need for filters and still using rectangular 
waveforms as local oscillator signals. 

Note that, while the idealised local oscillator waveforms shown in Figs. 12.10a 
and 12.10b are discontinuous, their Fourier series representations truncated at an 
arbitrarily high value of |n| are indefinitely differentiable functions. Suitably trun- 
cated Fourier series are adequate representations of practical signals and do not cause 
any mathematical difficulty. 

Consider the generic N-path receiver shown in Fig. 12.11. It is composed by 
N subsystems that are equal apart from the fact that the local oscillator signal of 
path k, k =0,..., N — 1 is delayed by 7 k/N with respect to path 0. The blocks 
preceding the output signals yọ represent LTI subsystem with impulse response A. 
Let the input signal be as in (12.42). Then, due to the tone at —nwy in the Fourier 
series of /, the kth output signal includes a tone at w; given by 


Vion = h(t) * [ane CTIE (E) x; e7 MOT FON! 


= [a H (Joe Je" K 
2m 

= yo,—n (P) X 

with H the Laplace transform of h. This shows that the output components of interest 


(at @,) are the product of the signal yo,—n and the constants est y »kK=0,...,N—1. 
By exploiting the properties of trigonometric functions we can form weighted sums 
of the outputs yg such that the resulting tone at wı vanishes for some values of n 
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N-l N-1 

k 

Z-n(t) = ) WkYk,—n(t) = Yo,-n (t) ` weer 
k=0 k=0 


Note that the sum on the right-hand side corresponds to a discrete Fourier transform 
of the weighting coefficients. For example, by choosing 


ie COs (=x) (12.43) 


we obtain 
N-1 


20 x 
zonlt) = yont) X cos (Zr) 
= N 
-1 
— Yo,—n(t) Yel Fare + el ¥0-Dk k 


2 k=0 


The sums are geometric series that evaluate to 


2: 2: 
el wind 1—e/ WD 


N otherwise 


= _as2a(ntl 
ey a Lijk pirm =e n+1A4ANm,meZ 


> 


and therefore the signal z_, is 


' n#Nmx+1 

Z—n(t) = N y 

z Y0,—n (t) otherwise. 

For example, for N = 8 all harmonics below the 15th except for the 7th and the 9th 
are suppressed. A mixer with no spurious responses at some odd harmonics is called 
a harmonic-reject mixer. 

The weighting factors of (12.43) are not the only possible choice. For example, 
any rotation of the indexes Wk+m) mod y produces a similar result with the addition of 
a phase factor to the output signal. For N even we can thus construct a full quadrature 
demodulator by building two weighted sums, one with weighting factors as given 
by (12.43) and the other by factors rotated by N/2 (wg+n;/2; see Fig. 12.12). The 
case with N = 4 corresponds to classical situation with differential output signals. 
Further choices of weighting factors allow isolating responses at values of n different 
from 1. 

While we discussed summing the signals after the LTI systems characterised by h, 
the same results apply if the signals are summed right after the mixers. The rejection 
obtained in practice is limited by mismatch between the paths. The place where the 
summation is implemented plays a role in this respect. 

If we revert the direction of the signals in the system of Fig. 12.11 we obtain an 
N-path transmitter. This is a generalisation of the classic case with N = 4 with the 
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4 input signals being differential versions of the r and q modulator input signals. 
As with the receiver, a larger value of N allow suppressing spurious emissions at 
harmonics of the local oscillator signal without the use of filters. 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, 
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate 
credit to the original author(s) and the source, provide a link to the Creative Commons license and 
indicate if changes were made. 

The images or other third party material in this chapter are included in the chapter’s Creative 
Commons license, unless indicated otherwise in a credit line to the material. If material is not 
included in the chapter’s Creative Commons license and your intended use is not permitted by 
statutory regulation or exceeds the permitted use, you will need to obtain permission directly from 
the copyright holder. 


Chapter 13 A) 
Weakly Nonlinear Time-Varying Systems |as 


The theory of linear time-varying systems can be extended to weakly-nonlinear 
time-varying (WNTV) systems in a similar way as we did for linear time-invariant 
systems. In this chapter we first define WNTV systems mathematically and high- 
light some important differences from the theory of WNTI systems. We then discuss 
weakly-nonlinear periodically time-varying (WNPTV) systems. These type of sys- 
tems generate a characteristic spectrum that is relatively easy to describe and is 
relevant, for example, in the study and design of communication systems. 


13.1 Weakly Nonlinear Time-Varying Systems 


13.1.1 Definition 


A Weakly-nonlinear time-varying system is defined as a system whose response to 
the input signal x can be described by 


oe) 
VO) =Y welt, Tiso., Tk) x XE Gis) (13.1) 
k=1 


The operator x is the extension of the operator introduced in Sect. 12.2.2 to higher 
dimensions. For causal systems described by regular distributions and driven by a 
right sided input x it is defined by 


k 
Well, Ti,- , TKK (T1, 0 Tg) := 
ri t 


fo fein meta) rtrd dey, (13.2) 


0 0 
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wg is the kth order fundamental kernel of the system. As for WNTI systems, to 
guarantee uniqueness, we require it to be symmetric in the variables T1, ..., Tk. 
Generalizations valid for a wider class of input signals can be done in the same 
way as was done for LTV systems. Note that wg » x®* is a distribution of the single 
variable ¢ and not a higher dimensional distribution as for WNTI systems. The reason 
for this is explained next. 
Consider a WNTV system described by a differential equation of the form 


L(t, D)y = N(t, D)x + o(t)y? + 3 (t)y? ++ 
with 


L(t, D) = D” + dm_\(t)D" | + +++ + ao(t) 
N(t, D) = b,(t)D" + b, 1D! +--+ + bolt) 


and where all coefficients a;, b; and c; are indefinitely differentiable functions. The 
equation can be solved iteratively as in the case of WNTI systems. We first solve 
the linear part of the equation. The solution y, is then used in the nonlinear terms 
to compute “nonlinear sources” of second order. With them we solve the part of the 
equation consisting of terms of second order only, a linear equation, and so on. 

There is an important difference compared to the case of WNTI systems: in the 
case of WNTI systems, to get around the lack of a general multiplication between 
arbitrary distributions, we made use of a direct product of distributions and introduced 
a multiplication based on the tensor product. Here the same method doesn’t work as 
the coefficients of the differential equation are functions of the single time variable 
t and it is unclear how to adapt them for use with higher order distributions. For 
this reason here the responses of all orders y% are distributions of the single variable 
t. To solve the equation we must therefore assume the existence of all appearing 
multiplications and powers y”, k = 2,3,.... 

If we consider ¢ as a fix parameter then the multiplication between components 
of y act as a tensor product like operation. Consider the product between yg and y; 


t t 
yet) y(t) = fof wee Ti, o, HX (T1, +> , Hdt1 + ++ dt 
0 0 
t t 
feo fme Ti, 7) x(t, +++, UATI +++ dT 
0 0 


t t 
= fof we. Tyee ey THIWI(E, Trp, -+ -> Tk+l) 
0 0 


; xH n, oe, Teg DOT ++ tea. (13.3) 
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The result has the form of a response of order k + / which can be interpreted as a 
“nonlinear source” generated by nonlinearities and lower order responses as desired. 

To solve the equation we must be able to solve the equation for each order indepen- 
dently and verify that it has the desired form. Solving the equations is (in principle) 


simple as all equations are linear. The solution of the equation consisting of terms of 
order k is given by 


t = T 
x= f f fva TET, Vises vy HOR? (T1, ++, ets de 
0 


with zg * x®* the nonlinear source and v the fundamental kernel of the equation. 
To show that this expression can be transformed in the desire form, consider the 


integral 
t T T 
[[ [rte Ti, T2)dt2dt, dT . 
0 0 0 


As a first step we exchange the order of integration between t and t; and obtain 


t t T 
iff f(T, Ti, T)dt,dtdt, é 
ou 0 


We then perform a second exchange between tz and t (refer to Fig. 13.1) which 


results in 
[I j f(T, UT, T2)dtdt2d7 . 


0 0 max(tı, t2) 


If the integral would involve more integrations between 0 and t then we could repeat 
the last step more times giving 


j- v í F(t, T1, -.., H)ATAT, -- (13.4) 


0 max(t,..., Tk) 
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Fig. 13.1 Domain of A 
integration (refer to text) T2 
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Using this result we can transform the above expression for y(t) into 


t t t 


y(t) = fof v(t, T)Zk(T, T1,..., TAT 


0 0 max(tı,..., Tk) 


x (1, ..., dt, -+ < dT 


which has the desired form w; * x2. 


13.1.2 Time-Varying Nonlinear Impulse Responses 


As for LTV systems, the response of WNTV systems can also be expressed in terms 
of the time-varying nonlinear impulse responses 


hg(t, E1, ..., Ek) = wet, t — &1,...., t — &) (13.5) 


and the convolution operator »*; for time varying systems 


Ag(t, 1,-2.5 Ek) že xO (Ei, -ap &) = 


t t 


foo [htir EE Gide + db. (13.6) 


0 0 
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Example 13.1 
Consider a WNTV system described by the following differential equation 
Dy +a(t)hy=x+y’. 


We are interested in the second order fundamental kernel of the system. 
The fundamental kernel of the linearized equation is given by (12.26) which, 
taking into account the commutativity of the product of scalar functions simplifies 


to ; 
wi(t, t1) = e la a(A)da i 


With it the linear response of the system is 
yi(t) = wilt, T1) * x(t). 


Given yı we can compute the “nonlinear source” of second order 
f f 
2 
y(t) = f fme Tti)wı (t, T2)x(t1)x (T2)dtı dt? . 
0 0 


With it we can then solve the equation consisting of terms of second order only 
(D + a(t))y2 = yr. 


The fundamental kernel of this equation is the same as the one of the first order 
equation. The second order response of the system is therefore 


t 


n= f wit, Dodt 


0 
t i, t 
= T] I w(t, T)W (T, T1)wı (T, T2)dT x(Tt1)x (t2)dtıdt2 . 
0 0 max(tı, t) 


The second order fundamental kernel of the system can be found by comparing this 
expression with y2 = w2(t, T1, T2) * x8? (T1, T2) giving 


t 


win. BS I 2 fi a(ada Ji aQ)da Sa Wd ge 


max(T),72) 
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As a check we verify that in the special case in which a(t) is constant we obtain the 
same result as in Example 9.5. Evaluating the integrals gives 


w(t, T) = i _ pee) 


and, after the variable substitutions £; = t — t;, i = 1, 2 we indeed obtain an expres- 
sion equivalent to h2 in Example 9.5. 


13.1.3 Time-Varying Nonlinear Frequency Responses 


Weakly-nonlinear time-varying systems can equivalently be characterised by time- 
varying nonlinear frequency responses. The kth order one is defined as the Fourier 
transform with respect to &,..., & of the impulse response h(t, &),..., Ek). For 
regular distributions 


hy (t, @1,---, Ok) = fof hetig oag (13.7) 


—00 —00 


with œ, é € R4. 
The response of order k of a system can be calculated by 


— sei 
y(t) = Onin wl, a, Op) LOT HOt g FP Cw, nn, we). (13.8) 


The derivation is entirely analogous to the one dimensional case carried out in 
Sect. 12.4.1. 


13.2 Weakly Nonlinear Periodically Time-Varying Systems 


Weakly nonlinear periodically time-varying (WNPTV) systems are weakly nonlin- 
ear systems whose characteristics vary periodically in time. In other words, their 
fundamental kernels, impulse responses and frequency responses are periodic func- 
tions of time and can therefore be expanded in Fourier series. For example, the kth 
order frequency response of a J -periodic system can be represented by the series 


[e0] 
Aag(t, o, OK) = J hen (Or, ogee" 


n=— 00 
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Fig. 13.2 Generic representation of a WNPTV system 


with wr = 27/T. This representation highlights the fact that such systems can be 
represented by a parallel connection of a countable set of weakly nonlinear time- 
invariant networks whose outputs are shifted in frequency by a multiple of wy 
(see Fig. 13.2). Practical applications where this representation is particularly useful 
include the analysis and design of communication systems. 

In the rest of this section we focus on the special case in which weakly nonlinear 
periodically time-varying systems are driven by a set of tones. This will reveal a 
spectrum characteristic of this type of systems. 


13.2.1 Discrete Convolution 


Before turning to actually calculating the response of WNPTV systems driven by 
a set of tones, it’s convenient to introduce some notation that will simplify many 


expressions. 
[e6] 
A series J` a, is absolutely convergent if the sum of the absolute values of the 
n=—OoOo 
terms converges 
[e6] 
> lan| < co. 
n=— 00 


In this case the value of the series doesn’t depend on the order of the elements. 
The product of two absolutely convergent series ) ae an and are bn is also 
absolutely convergent 
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CO CO [0,6] oe) 
| > a X. bls > lal DS Iba] <œ 


n=— 00 n=—00 n=— 00 n=—00 


and can be expressed as 


loo) (oe) 


`y an 2 b, = 5 ( 5 dibi): 


n=—00 n=—00 n=—00 q=—00 


The inner sum in the last expression is called discrete convolution (or Cauchy prod- 
uct). For convenience, we are going to denote it by 


(oe) 


(a, *d b)n := > Aghn—q (13.9) 


q=—00 


The discrete convolution is associative and commutative 


(a, xa b.) *a c.) = (a. *a (b. *a c.)), 
(a, Xq b)n = (b. Xq a)n 


and has a unit element, the Kronecker delta 


ôn = (13.10) 


13.2.2 Product of Fourier Series 


In the following we use the convention introduced in Sect. 4.5 of denoting the kth 
Fourier coefficient of a distribution f by c( f). 

It is well known that if t œ> f(t) is a continuous 7 -periodic function, its Fourier 
series is absolutely convergent for all values of t [23]. If f and g are two such functions 
then their product is well-defined and continuous. In addition, the Fourier coefficients 
of the product can be expressed in terms of the coefficients of the individual series 


[0,9] [0,9] [oe] (oe) 


2 ae D | en(gyemr' =) | ( Sa cals) Je. 


n=—CO n=— 00 n=—C q=- 


The coefficients of the product are evidently the convolution product of the coeffi- 
cients of the two series 
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[oe] 


(Afjxe@)) = >) el) cn- l8) (13.11) 


q=—00 


Let now f and g be two T -periodic distributions. Let further introduce the 
sequences (fx) and (g) defined by 


fe=f*Be and gk=8* fk 


with (6%) a sequence of functions in D converging to ô (for example the sequence 
of Example 2.5). Then (fg) and (g) are sequences of indefinitely differentiable 
functions converging as distributions to f and g respectively. The Fourier series of 
each member of each sequence is thus absolutely convergent. 

If the product f g exists, then it defines a 7 -periodic distribution which must 
coincide with the limit of the sequence 


fg = lim fi ge. 
k—> oœ 


The Fourier series of each member of the sequence can be written as 


[oe] 


Si 8k = 5 (Efe) *a ¢.(ex)) e7 . 


n=—OOoO 


Therefore, from the assumption of convergence and the uniqueness of the Fourier 
series representation of periodic distributions we conclude that the Fourier coeffi- 
cients of f g must be 


en f 8) = (6.(f) *a €.(8)), = lim (c. (fi) *a €.(80)),,- 


Example 13.2 


Consider the regular 7 -periodic distribution shown in Fig. 13.3 that we denote by f 
and whose Fourier coefficients are 


0 n even 
ne 2(-1)"7 nodd. 
From the graph it’s apparent that the product of f with itself is well-defined and 
produces the regular distribution with constant value 1. The Fourier coefficients are 
evidently all zero apart from the zeroth one whose value is one co(f f) = 1. We 
show that, despite the fact that the Fourier series of f is not absolutely convergent, 
c.(f) *a c. (f) produces the right answer. 
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Fig. 13.3 Square regular 
T -periodic distribution 
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First note that for n odd either cg(f) or Cn-q4( f) is zero for every value of q 
Hence, 


(c.(f) *ac(f)), = 0 


n odd. 
For n even the convolution product is 
[oe 
2 2 n=2Uk+1) 
= 1) n=20+1) 
(c.f) a eP), as xk + D. ) a(n — (2k + pS 


= con) D ; 


 QOk+ Dn k+) 


(oe) 


a 


The zeroth coefficient is therefore 


For the particular case n = 0 the summation in the last expression can be written as 


00 2 
TT 
«(2k + ES 2 ai 1)? 4 


2 
COECOI OK 


4 
To evaluate the Fourier coefficient for n Æ 0 it’s convenient to rewrite the sum- 
mation as 
[o0] [o6] 
1 l/n l/n 
D = Ps, / ' 
ae (2k + 1)(n — (2k + 1)) Ae 2k+1 (n—(2k+4+1)) 


In this form it’s apparent that for each value of n all terms cancel in pair (the kth with 
the (n/2 + k)th), thus giving 


(c.(f) *ae(f)), = n # Oeven. 
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13.2.3 Response to Multi-tones 


Consider a weakly nonlinear periodically time-varying system described by the dif- 
ferential equation 


L(t, D)y = Nit, D)x + ot) y? + t) y? +- 
with 


L(t, D) = D” + am-1 6) D”! + +++ + ao(t) 
N(t, D) = b (t) D" + byi D"! +--+ bolt) 


and where all coefficients a;, b; and c; are smooth 7 -periodic functions. We assume 
that the system is driven by N complex tones 


x(t) = Ae p -o + Aye 


with A,,..., Ay the phasors of the tones. 
In Sect. 12.4.2 we saw that the solution of the linear part of the equation is given 
by 


N 
yit) = So Anhi(t, one!" 


n=1 


with hy the (first order) time-varying frequency response of the system. We also saw 
(Sect. 12.5.2) that t +> A(t, wı) is a J -periodic function. Expanding it in a Fourier 
series, yı can be written as 


N oo 
yi(t) = >> Apel Y figl). 


n=l q=—00 


yı is therefore a sum of tones at qær + œn. 

We now solve the nonlinear equation by adding terms to y; in a similar way as 
we did for weakly nonlinear time invariant systems in Sect. 9.5. As explained in 
Sect. 13.1 here we must assume the existence of the powers yk, k =2,3,... and the 
others that will appear below. 

For the sake of solving the equation let’s assume that the frequencies wy, @1,..., 
on are all incommensurate. Under this assumption, the only power resulting in terms 
proportional to A j Aye! @iF ev"; j,/=1,..., N is the second order one 


2! 
2 — mı MN „]Ømt 
co(t t)= —A,'.---A,e 
O=), Ai N 
|m|=2 
[o,e) 
$ (c.(c2) a fi, Oy ea a hi, Cony’) 01907" 


q=-00 
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with m the multi-index m = (m,..., my) whose elements range from 0 to k (=2) 
and œm as defined in (9.27) and repeated here for convenience 


N 
On = J Mn On = M0, H: + Myon. 


n=1 


Similarly to the time invariant case we can assume that the solution of the 
nonlinear differential equation includes a term of second order y2 proportional to 
A; Aje? @itent, j,l=1,..., N. y2 can be found by retaining only those terms in 
the equation that are proportional to A ; A;e/ (+0! The resulting equation is linear 
with c(t) y2(t) playing the role of a source composed by tones. Exploiting linearit 

yi) playing P y p 8 y 
we can solve the equation for a single tone at qær + wı + w2 and combine the results 
at the end 

L(t D) êz q(t o wy )e) Tto to) = el Gortator)t 


th> 8, g(t, @1, @2) is also J -periodic and can be expanded in a Fourier series 


or) 
a a +q)o7 ++, )t 
82.9 (t, Œ, œz )e! THH) = a ĉj; Gae" qQ2)or+o, o) ; 


q2=— 00 


With it the second order term yz is given by 


2! 
yo(t) = DY Ap! «+ Apre 


|m|=2° 
[e0] 
DS (6.2) a ĥi, (oi) xa + a hi, (ony), 07" 
qı=—00 
[o0] 
5 Cq l2 qg 0120"! 
q2=—00 


which, with the change of variable / = qı + q2, can be rewritten as 


2! cr 
y2(t) = 5 LAT o ANN 5 Âz m pel orton 
|m|=2 uo l=—oo 
with 


CO 


fiama c= DD (c.(c2) *a hi, (01) eg ka EO S (82,91) - 


qı=—00 
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The second order response of the system thus consists of tones at all possible sums 
of two of the input tone frequencies at a time, around each of the harmonics of the 
fundamental frequency of the system. 

The higher order responses can be calculated in a similar manner. The kth order 
response has the form 


k! m m > i orto 
y(t) = > „14l 1 ee ANN > firm 1e”! 7 +Om dt (13.12) 


|m|=k g l=—oo 


and is composed by tones at all possible sums of k input tone frequencies at a time, 
around each of the harmonics of the system fundamental frequency. A comparison 
of the typical two tones response of LTI-. WNTI-, LPTV- and WNPTV-systems is 
shown in Fig. 13.4. 
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Fig. 13.4 Comparison of typical two (real) tones spectral response of LTI-, LPTV-, WNTI- and 
WNPTV-systems 
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Note that the factor 


oo 
) fik m 187 


l=—00 


appearing in the kth order response yx is the Fourier series of the time-varying kth 


order nonlinear frequency response of the system h k(t, @1,..., yg). It is related to 
hkm by 
hkm. = c(hg(t, %1, aE w1, t ON, ated ,@n)) ’ |m| = k. 
— —<— — a 
mı my 
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Chapter 14 A) 
Periodically Switched Circuits get 


This chapter is devoted to illustrating applications of the theory of weakly nonlinear 
time-varying systems with practical examples. After introducing the class of electri- 
cal networks called switched circuits, we analyse in details some instantiations. We 
analyse a practical implementation of a quadrature modulator, including its distortion 
and other aspects of fundamental importance in applications. As another example of 
the usefulness of time-varying circuits, we illustrate how they allow implementing 
highly selective filters that are otherwise unfeasible in small integrated form. 


14.1 Switched Circuits 


Animportant class of circuits that finds many applications is the Switched circuits one. 
These are circuits whose only time varying components are switches. Despite the 
fact that switches are time-varying resistors, this class of systems can be analysed as 
a sequence of time invariant circuits, each one valid over an interval over which all 
switches remain in the same state. 

Let O denote an open interval of R” . The space Do of test functions @ with support 
contained in O is a vector subspace of D. A distribution on O is a distribution defined 
on Do. The vector space of distributions defined on O is denoted by D'o. 

Consider a linear switched circuit including at least a capacitor or an inductor. 
Without loss of generality we can assume the network to be driven by a single 
independent source x (superposition principle). Let t;,i € N denote the times at 
which any of the ideal switches changes state and O; the open intervals (¢;, t;+1). 
In any of these intervals the network is described by a system of first order linear 
differential equations 

Du = Aju + Bix 
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with u the state of the network that we can represent by the voltage across the 
capacitors and the currents through the inductors. Suppose that the network is in the 
zero state and that we apply a Dirac impulse at time t with t; < T < ti+1. Then, 
for tT < t < t;+, the state u evolves as a continuous function. At time t;+ı some of 
the switches change state. If we exclude circuits including closed loops composed 
exclusively by ideal inductors, ideal voltage sources and (closed) ideal switches, then 
at this time one of the following happens 


e If an ideal switch across a capacitor is closed, then the voltage across that capac- 
itor immediately after t;,; becomes zero: vc(tj41+) = 0. Since charge must 
be conserved, this change in state must be accompanied by a current impulse 
Uc (ti41—-) CS (t — ti+1) discharging the capacitor through the switch. 

e If the closing of a switch forms a closed loop formed exclusively of (ideal) capac- 
itors then at time #;,, the charge in the capacitors will instantly redistribute under 
the constraint of charge and energy conservation. This will be accompanied by cur- 
rent impulses through the capacitors. For example, if at t;}ı two capacitors with 
capacitance Cı and C2 respectively are connected in parallel by an ideal switch 
then the voltage across the capacitors immediately after the closing of the switch 
will be 


viti HIC + Co] = v2(ti41 FIC) + C2] = v1 (ti41—-) C1 + v2 (ti41-) C2 « 


e If an ideal switch in series with an inductor is opened then the current through 
the inductor immediately after ¢;,,; becomes zero: i, (t;4;+) = 0. Faraday’s 
law implies that this change in state is accompanied by a voltage impulse 

ir (tj41—)Ld(t — t)41) across the inductor. 

e In all other cases the voltages across the capacitors and the currents through the 
inductors remain unchanged. In other words the state component um representing 
any of those quantities immediately after t;}ı must equal the state component 
before that time instant: um (tiy1 +) = Um(ti41—). 


These conditions specify initial conditions for the interval Oj+; that, together with 
the differential equation, allow to calculate the evolution of the state u of the network 
in that interval. The same arguments apply to all subsequent switching times. The 
state u is therefore fully determined and can be extended to a distribution on the hole 
of R. The fundamental kernel W is therefore well defined for t € O;,i € N. 

The only problematic cases are when t coincides with one of the switching times. 
To work around this problem we limit the set of allowed input signals to the set of 
regular bounded distributions. Then, assuming further x to be right-sided, the state 
of the circuit can be represented by the integral 


u(t) = J W(t, T)B(t)x(t)dt. 
0 
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Fig. 14.1 Voltage-mode vgo(t) O 
quadrature modulator elf 
Osu) oY 
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vr (t) = —vr(t) 
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Since the values of t at which W is not defined is a set of zero measure, the output 
is well-defined at all times. 

The above discussion shows that for switched circuits the computation of the 
time-varying impulse response is a straightforward process. In addition, since the 
impulse response in each interval O; corresponds to the one of an LTI system, the 
computation of the time varying frequency response by Fourier transformation of 
h(t, €) does not pose any problem. 

Linear periodically switched circuits are linear switched circuits in which the 
operation of the switches is periodic. For these circuits the time-varying impulse 
response h(t, &) and the time-varying frequency response A(t, @) are periodic in 
time. 

In the following we illustrate the use of this technique to analyse idealised versions 
of some practical periodically switched circuits used in communication receivers and 
transmitters. 


14.2 Voltage-Mode Quadrature Modulator 


In this section we analyse an implementation of the quadrature modulator of Exam- 
ple 12.9 suitable for realisation in a CMOS technology and shown in Fig. 14.1. The 
input signals v; and vg (called r and q in Example 12.9) are applied differentially. 
We assume the LO signals to be non-overlapping and to have very fast edges. In fact 
we model the gate signals as having rectangular waveform high 25% of the time and 
with a relative delay among them of T /4. We assume further that when the corre- 
sponding LO signal is high each MOSFET can be modeled as a resistor of value 
ron, While when the LO signal is low it can be modeled as an open circuit (infinite 
resistance). We are interested in the signal v4 at the input of the amplifier following 
the switching transistors and assume that the input impedance of the latter can be 
adequately modeled as a capacitor. 
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Fig. 14.2 Voltage-mode quadrature modulator model 


Under these assumptions the circuit is linear. Hence, we can analyse the contri- 
bution to the output of each input signal independently. Figure 14.2 shows the model 
used to analyse the contribution of signal v} where we have combined roy with the 
source resistance (assumed equal at all inputs) 


r =ron + Rs 


and where we assume the switches to be closed when the corresponding LO control 
signal /;, i = 0,..., 3 is high and open when low. The control signals are defined by 


L(@):=lt-iT/4) i=0,...,3 


with / the signal introduced in Example 12.10 and shown in Fig. 12.10a with t = 
T /4. 


14.2.1 Single Input Response 


In this subsection we analyse the response to v}. To compute its contribution to the 
output, we apply an input impulse at time t. If the impulse is applied when the input 
switch is open, then its contribution is zero. If it’s applied when the switch is closed, 
t € (-7 /8,7 /8) mod 7, its contribution is described by the differential equation 


(l+rCD)v, = d(t —T). 
Note that the capacitor C is connected in parallel with a resistor of value r at all 
times! The fundamental kernel is therefore 


1 
W(t, T) = wag EO 1,(t-—T)lo(t), — @xan = € 
7 


and can be interpreted as the impulse response of an LTI system whose input signal 
io pt 
is v7 (t)lo(t). 
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The time-varying impulse response of the system can be obtained from the above 
fundamental kernel by applying the variable transformation € = t — t 


A(t, E) = asap ee °™** 14 (E) lolt — £) . 


We compute the response of the system to a complex tone through the time- 
varying frequency response. The latter is obtained by Fourier transforming h(t, £) 


with respect to € 
[o0] 


h(t, w) = / h(t, €)e 1% dé. 
—0o 
Using the Fourier series of lo 
00 1 
1 =0 
lo(t) = ane” T , an =an = 44 á 
o(t) 2 fe sin(n4) n>0 


where wr = 2x /T , we have 


n=—oC 


[o6] 
[o0] 
ht, w) = dB f e7 rat Jove apert) dé 
0 


(oe) 


[0,6] 
= @3dB 5 a dé e/"7! 


n=—oCo 0 


5 fn lo) e” (14.1) 


n=—oCo 


with 


[o0] 
hy(w) = anasan | Snie dé 
0 


an 


= ——____. 14.2 
1 + J @tnor ( ) 
©3dB 
The response of the system to a complex tone of angular frequency w is thus 
[o0] 
va(t) = > hy(w) ef OTO (14.3) 


n=— 00 


and, as remarked above, is seen to be equal the response of an LTI system with 
transfer function 
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(14.4) 


to the input 


CO 
FOLA = Sz ae erent, 


n=—C 


14.2.2 Input Current and Switched Resistor 


Before combining the outputs from the four input signals v7, v7, vå and vg we 


compute the input current drawn from the input vp when the other sources are 
disabled. When the input switch is closed, the input current is equal to the current 
flowing into the capacitor, while when the switch is open, the current is zero 


is(t) =In@ic@, ict) = CDi} (9). 


The current is therefore given by 


oo 
X j ape!”®T! 


n=—C 


o0 (w+nor) 
an J @3dB el (w+nowr)t 


E o+nor 
n=—0o n I+) 3dB 


is(t) 


(w+nor) 


oo o0 
3 y Anan J @3dB e/lot@tnjer It 
r 1+ j 


M>=—OO N=—OCO ©3dB 


(oe) 


XO yor (14.5) 


k=—0o 


with sa 
J(@ +nor)C 
yko) = J akna, —$— r (14.6) 
pem Bae hares 


where in the last step we made the substitution k = m + n. 
Let’s consider more closely the term for k = 0. Substituting the expression for a, 
we obtain 


3 sin’ (n=) J(@ + nwr)C 


2; o+nor 
n#0 (mn) I+) @3aB 


( ) 6 
w) = Jæ + 
Yo J I 


To find an approximate value for this series it’s useful to separate real- and imaginary- 
parts 


yo(@) = go(w) + jbo). 


14.2 Voltage-Mode Quadrature Modulator 321 
We start by simplifying the imaginary part 

ay) 
C y sin“(n7) (w+ nwr)C 


2 2 
16 n0 (xn) 1+ (2) 


@3dB 


bo(w) = w 


The first thing to note is that the terms proportional to nw; with n > 0 cancel with 
the ones for n < 0 so that we obtain 


1 sin? (n=) 1 
bo(w) = c| +5 4 | À (14.7) 
ee ia) 


All terms in the square bracket are positive. If we assume |w| K œr < wzag the 
terms decrease as 1/n? forn < wr /wzag and as 1/n* for larger values of n. We can 
therefore bound the series by 


sin? (nZ) 1 1 
>. - 7 >, Genk 


2 
n#0 (an) 1+ (=) n#0 


Using the known result 


we thus obtain the upper bound 


1 2 1 1 
bo(@) a 3 9 


< a iu mm na 
oC 16 Gn)? 48 


n=l 


This bound is tighter for large values of wag /Æ@r. 
To obtain a value closer to the actual value of the series we note that for N > 1 


D sitat) x a (14.8) 


Instead of bounding sin? (nz /4) by 1, we approximate its value by 1/2 independently 
of n to obtain 


bey 1 1 11 
= x 0.23. 
oC “667Z 


Figure 14.3 shows the normalized value of bo as a function of 343 /œr computed 
from (14.7). We see that, while the argument to obtain the above approximate value 


322 14 Periodically Switched Circuits 


Fig. 14.3 Normalized 
values of the real- and 
imaginary parts of 
yo(0.1@7-) as a function of 
the ratio w3gB /@r 


is questionable, for large values of wag /@7 the approximation is remarkably close 
to the real value. 
We next turn to the real part of yo(w) 


80(@) = 5 a) (rs 


(14.9) 
2 2 
r n20 (an) 1+ (eer) 


If we assume |w| K wz < w3zag then to a good approximation we have 


2, 
2 3 sin?(n®) (=) 


80(@) x 2 2 
F n=1 m 1 + (2) 


If wr /wzag K 1 then the quadratic term in the denominator can be neglected in 
a large number of terms up to approximately N = ia: The first N terms of 
the series contribute the largest part of its total value. Therefore, referring again 
to the approximation (14.8), we approximate again sin? (nz /4) by 1/2. Instead of 
neglecting the terms for n > N we approximate the series by the integral 


CO 
1 or 1 


mr @zag J 1 +x? 
0 


with 
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Fig. 14.4 Normalized is (t) 
for a cosinusoidal input with 
o=0.lor, wag = 207 
computed using (14.5) 
truncated at |k| = 60 and 

|n| = 200 


ig(t) / (C/T) 


This integral is easily solved and we finally obtain 


1 or T C 


MAN ones 2 T 


Figure 14.3 shows the normalized value of go as a function of wag /œr computed 
from Eq. (14.9). For wag /ær > 3it’sin very good agreement with the given approx- 
imation. 

Figure 14.4 shows the normalized current is(t) for a cosinusoidal input with 
w = 0.lær and wzag = 2w7. The curve consists of peaks in concomitance with 
the closing instants of switch 0, followed by an exponential decay with a time con- 
stant of approximately | /@34, and a sudden jump to zero at the instants where switch 
0 is opened. (The oscillations around the instants where switch 0 changes state are 
due to the Gibbs phenomenon of the Fourier series.) As the time constant is short- 
ened by reducing the value of r, the curve converges to a series of Dirac pulses at the 
closing instants of switch 0. If we shift the closing instants of switch 0 at multiples 
of T we can express this behavior by 


T aC 
: r pee) ae ~“ aJkørt 
lim is (: 8 ) > Fe ; 


w—>0 k=—o0 


The discrete spectrum of is(t — T /8) for wzag = 3wr, 20wr and w = 0.lw7 is 
shown in Fig. 14.5. The figure shows that as the value of w34g/æœr is increased, 
an increasing number of coefficients y,(w) tend to approach the value of C/T as 
expected. At a value of k © wz3ag/æœr the real and imaginary parts have roughly the 
same value and for larger values of k the magnitude of y;,(@) decreases. 
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Fig. 14.5 Normalized real- 
and imaginary-part of 

ye (we J @+kor)T/8 for 

w = 0.læy as a function of 
k computed with (14.6) 
truncated to |n| = 2000. The 
points between the discrete 


values of k were joined to 5 
better highlight the trend =` 
> 
gy / 
bp / 
l gk (C/T), @gagftoy = 20 
l i bk/ (C/T), @gqp/@r = 20 l 
4 
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k 


14.2.3 Full Response 


We now go back to the voltage across the capacitor v4 and calculate the combined 
response to all four signals v7, UL, vp and vo. To distinguish the four responses we 
will add a subscript equal to the one of the corresponding LO signal. Thus in the 
following we will denote the response to the signal v} given in (14.3) by v4,o. The 
contribution of v7 differs from the one of v} by (i) a shift by T /2 in the LO waveform 
and (ii) a reversal of sign of the input signal. Its contribution to v4 is therefore 


io) 
va.2(t) — gt > hy (@) elnor(t-F /2) 


n=—Co 


o0 
>. (CDH Alw) eJ @tnor)t f 


n=—OO 


Note that the even harmonics have opposite sign compared to the ones of v4.9, while 
the odd ones have the same sign. Therefore the combined response of v} and v7 
consists of odd harmonics only 


vao(t) + vat) = 2 5 h,(w) e7 (otnor)t | 
n odd 


The response to the signal vó differs from the one to vf by (i) a shift by —T /4 
of the LO signal and (ii) a shift of 7 /4 in the input signal 


aes jot 
Yo = Je i 


Its contribution to v4 is therefore 
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œœ 
va3(t) = — je” 5 În (w) enor (t+T /4) 


n=—CO 


o0 
2 5 Po hh (w) eJ @tnor)t . 


n=— 00 


Similarly, the response to the signal vo differs from the one to vp by (i) a shift by 
T /4 of the LO signal and (ii) a shift of —7 /4 in the input signal 


oo 
va) = jel” > hin (œ) eer -T/4 


n=—00 


[0,0] 
2 5 CAO e/ @tner)t . 


n=— 00 


Again we note that odd harmonics of v4 3 and v4,ı have the same sign, while even 
ones have opposite sign. The combined response of these two signals is therefore 
also composed of odd harmonics only 


vas O + va O = —2 Yo" hay (w) ef Oen! 
n odd 
= =) 5 (=1)"+D/2f, (w) e/ (@tner)t i 
n odd 


We now combine the two partial sums v4.9 + va,2 and v4.3 + va,1. The terms 
for n = 1+ 4m,m € Z have the same sign, while the terms at n = —1 + 4m have 
opposite sign. The total sum is therefore 


va(t) = va olt) + va2(t) + v43(0) + va, (t) 


oo 
D 1+4: 
=4 > hitam (a) ellor( Terop] : 


m=— 00 


Note again that the response of the system is equal to the one of an LTI system with 
the transfer function given by (14.4) and driven by the input signal 


x(t) =F Mlo(t) +r OHO EOLO HOLA. 


Using four signal paths (two differential) this quadrature modulator cancels three 
spurious emission tones every four. It’s a transmitter implementation of the harmonic- 
reject mixer presented in Example 12.10 where we showed that to suppress more 
harmonics requires a larger number of signal paths. 

The above derivation of the output signal highlights the fact that suppression of 
harmonics relies on exact cancelling of strong tones. We will investigate some imper- 
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fections limiting the amount of cancelling achievable in practical implementations in 


later sections. Before turning to that question we investigate the effect called carrier 
leakage. 


14.2.4 Carrier Leakage 


Carrier leakage refers to the presence of a tone at tw, in the output spectrum of 
the modulator. In transmitters, it is one of the undesired tones close to the signal of 
interest (or, depending on the architecture, indeed overlapping with the modulated 
wanted signal) that can’t be easily filtered. It is caused by the presence of small, DC 
offset voltages at the inputs of the modulator. These offset voltages are the result of 
mismatch in the driving circuits and are therefore Gaussian random variables. 

We denote the DC offset random variables by Xg, k = 0,..., 3 where the index 
matches the one of the corresponding switch. They form the following signal at the 
input of the equivalent LTI system 


Xolo(t) + Xi h(t) + Xbt) + X; h(t) 
CO 
= 2 (Xo + Xje™ 3” + Xe 1" + X3e7) 2 ")a,e"r" 
n=—0O 
CO 
= So (Xo +X + XD" + X3(j)" ane!" (14.10) 


n=—CO 


Since usually the most problematic tone is the one at +w 7 we only consider the 
terms for n = —1, 1. The term for n = 1 is 


[Xo — X2 + J (X3 — X) are?" 
and the one at n = —1 is its conjugate complex. The sum of the two terms gives 
2a;[X-cos(w7t) — Xs sin(wrt)|,  Xe= Xo- X2, Xs = X3- Xı. 
Linear combinations of independent Gaussian random variables are Gaussian. There- 
fore, if we assume Xx, k = 0, ..., 3 to be independent of each other, X. and X, are 
independent Gaussian random variables as well. We denote the standard deviation 
of X, and X, by ox. Their joint probability density function (PDF) is 


MEAE 


7 e 2oy 5 
2mo% 


PX.,X; (Xe, Xs) = Px, (Xe) Px, (xs) E 
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It is now convenient to pass to polar random variables. Specifically, using the 
relation 
cos(wt + ¢) = cos(@) cos(wt) — sin(@) sin (wt) 


the sum of the input terms for n = 1 and -1 can be rewritten as 
2aı X, cos(wrt + Xe) 


with the new polar random variables 


X, =, X2 + X? 


Xs 
X¢ = arctan ~>. 
Xe 


Given that the probability density in terms of X<, Xs must agree with the one in terms 
of X,, Xg, we must have 


PX.,X, (Xe, Xs )dx dx; = PX,,X4 (Xr, xg )dx,dxg . 


From this equation and dx,dx, = x,dx,dxg we therefore deduce 


y 
xf 
20 


Xr = 
X., 


X, X (Xr, XG) = 
E 2x02 


This joint probability density function is easily factored 
PX, Xo (Xr, Xp) = Px, (Xr) Px, (XG) 


which implies that X, and X are independent random variables with the following 
probability density functions 


1 -4 
Px, Xr) = -z Xre ž, x, 3 O (14.11) 
Ox 
+ 0 < x < 27 


Px, %o) = (14.12) 


0 otherwise. 


The phase random variable X¥ is uniformly distributed over the full circle. The distri- 
bution of the variable X, is called Rayleigh distribution. Its PDF and complementary 
cumulative density function 1 — Fy, (x,) are plotted in Fig. 14.6. The PDF assumes 
its maximum at x, = oy. The expected value and variance of X, are 
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Fig. 14.6 Rayleigh 
distribution for cy = l.a 
Probability density function. 
b Complementary 
cumulative density function 
1— Fy, (xr) 
Xy 
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[o0] 
T 
E[X,] == Xr PX, (x;)dx, = 30X 
0 
and 
Sy 
Var(X,) = El, — ELX, D] = S03 
respectively. 


The carrier leakage of the modulator is therefore given by 


2 
VAHU Cera 


X;—? 
T 


with the phase uniformly distributed over the full circle, The magnitude of the tone 
is Rayleigh distributed and if wr « w3ag has an expected value of 
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Ox 


Jt 


From Fig. 14.6 we read that, under the same assumption, 0.1% of the modulators 
have a carrier leakage magnitude exceeding 


2 
3.7 —oy © 1.67ox. 
IT. 


14.2.5 Image-Rejection 


In this subsection we come back to the finite cancelling of harmonics in practical 
implementations. We assume again the common case of a transmitter up-converting 
the input signal to w + wr with |w| K wr S wrap. 

In previous calculations we assumed perfectly balanced signals v; = —v;, vo = 
—v5; equal amplitudes for all signals, a phase difference between v; and vo of 
exactly 2/2 and delays between the LO signals of exactly 7 /4. If we now introduce 
small differences in the amplitudes 


vp (t) = (Ar + AAs/2)e!", v7 @) = —(Ar — AAs /2)e" 
the even harmonics of v (t)lo(t) + v7 (h(t) do not cancel perfectly anymore 


ve (lot) + vp Ola) = $ 2A trt + YY Arae orto 


n odd n even 


and similarly for the signal Up (t)l3(t) + val (t) 


OLO + OLHO = — DI "2A gaperorsornn 
n odd 


= pa J'A Agape or tonAg] 


n even 


where in addition we have added a small delay error in /3 and J; of At and set 
Ad = 21 At/T. If we now sum these partial sums, the complete cancelling that 
was happening for three harmonics out of four becomes a partial cancelling. In 
particular this partial cancelling causes the appearance of a tone at |w7 — œ| (for 
n = —1) which is difficult to filter as, in a single-sided representation, it appears 
very close to the wanted signal at wy + œ. The tone at |w7 — @| is called the image 
of the wanted signal. The ratio of the magnitude of the image to the one of the signal 
is called the image-reject ratio (IRR) and is given by 
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Fig. 14.7 Mixer -15 
image-reject ratio 


IRR (dB) 


Al = Age!*¢ 
Ar + Age /4¢ 


_ [(Ar — Ag)? cos?(A@/2) + (Ar + Ag)? sin?(Ag/2) 
~ V (A; + Ag)? cos?(A@/2) + (Ar — Ag)? sin?(A@/2) 


Jé Z =). + tan2(Ag/2) (14.13) 


IRR =| 


Aye /4¢/2 — Age! 4?/? 
> Beare + AgeJ4¢/2 


A; + Ag 


where in the last step we neglected the term (A; — Ao)” sin?(A@ /2) in the denom- 
inator which is of second order in the errors. Note that part of the phase error could 
well come from the input signal. This is the IRR of the effective signal at the input 
of the LTI system H(s). It is plotted in Fig. 14.7. 


14.2.6 Effect of Mismatch 


In this subsection we investigate the effect of mismatch which, as we will see, is 
another phenomenon limiting the amount of harmonic cancelling achievable in prac- 
tical implementations. 

Due to mismatch, each of the four transistors with which the modulator is imple- 
mented (see Figs. 14.1 and 14.2) presents a slightly different roy resistance and 
similarly for the four source resistors. Therefore, the value of the resistance con- 
nected to the capacitor is not independent of time, but is time-varying 


3 


rt) = ork), r€ Rk =0,...,3. 
k=0 
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Fig. 14.8 Possible sample wzag(t) A 
waveform of the time 
varying cut-off frequency or 
w3dB(t). Variation greatly ee wo 
exaggerated for illustration T 


The differential equation describing the system therefore becomes 

1 
r(t)C 
x(t) = vp H bA +07 Oh) + vgObO HOLA 


[D + wag (t)]va = @zap()x(t),  @3ag(t) = 


where we denoted the sum of the input signals by x. Since the variation of the 
resistance from the nominal value is small, we can solve the equation using the 
perturbation method and proceed as in Example 12.8 (Fig. 14.8). 

As a first step we develop œw3ag(t) in a Fourier series and decompose it in two 
parts 


zag (t) = we (t) + ws (t) (14.14) 
[0,6] 
welt) = ro lolt) + r2 (£) = @e.0 + Xe X wn cos(nort) (14.15) 
n=1 
[0,6] 
ost) =n hO) +r BE) = oso — Xs X. w sin(nwrt) (14.16) 
n=1 
with 
4 . T 
Wn = — sin(n—), n>0O. 
mn 4 


@-(t) corresponds to the curve of Fig. 12.10b with t = T /4 scaled by X. plus a 
constant term. w; (t) is constructed similarly, but with the curve of Fig. 12.10b shifted 
by —T /4. We use the symbols X, and X, to denote independent Gaussian random 
variables as in Sect. 14.2.4, but they are not related to the quantities of that section. 
We also denote again their standard deviation by ox. 

The sum of the constant terms (which are also random variables) is the average 
frequency 

wo = Wc,0 + @s.0 - 


332 14 Periodically Switched Circuits 


The variable part of wag (t) can be written as 


[0,0] 


> Wr[X- cos(nwrt) — X, sin(nært)] 


n=1 


Proceeding as in the analysis of carrier leakage we can express it in terms of the polar 
random variables X, and X% 


CO 
[> Wn X, COS(nwrt + X¢). 


n=1 


Using this form for w3za4g (t) the differential equation can be written as 


(D + wo)va = wsag(t)x (t) — Y wr X, cos(nort + X)va(t). 


n=1 


We solved this equation to first order in X, for one input tone and one cos term 
in Example 12.8. Referring to that example for details, we conclude that mismatch 
produces tones at all frequencies nw; + w,n € Z. The amplitude of these tones is 
proportional to the random variable X, which is Rayleigh distributed. The phase is 
uniformly distributed over the full circle. 

While in this subsection we focused on mismatch, the same method can be used 
to analyse other effects causing variations in the resistance such as overlapping LO 
signals. 


14.2.7 Second-Order Distortion 


In this subsection we analyse the distortion of second order introduced by the nonlin- 
ear characteristic of the MOSFETs. As discussed, if we neglect mismatch the circuit 
can be modeled as a time-invariant system drived by the input signal 


x(t) = vf DO) +07 OLE) + BOLO + vgOhO. (14.17) 


To simplify the calculations we discard the source resistors Rs and consider the 
situation shown in Fig. 14.9a. We assume the transistor to remain in the so-called 
linear region of its characteristic which is described by 


B 
ip = B(vg — Vr)(up — vs) — zD — vz). 
In our model the gate voltage is assumed to be constant at a sufficiently high level 


Vg, in which case the characteristic can be modeled by the linear resistor r that we 
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b ga? 7 y’) 


1 “lea © =o) 


= 


IH 


Fig. 14.9 a Equivalent WNTI schematic of the quadrature modulator of Fig. 14.1 b Equivalent 
WNTI circuit of the quadrature modulator of Fig. 14.1 


used before and two nonlinear VCCS that we combine in a single one controlled by 
the two voltages vp and vs 


ip = g1(Up — vs) + B2(VH — v3) 


with 
1 B 
g8i=—=B(VG—Vr) and g=—- > 
r 2 
and represented in Fig. 14.9b. Using this transistor model the differential equation 
describing the system is 


(1 +rCD)va =x + Sa — v3). 
1 


The first order response of the system is described by the transfer function H that 
we calculated before, that we repeat here for convenience and to which we add an 
index representing the order as usual 


1 
1S 


@3dB 


Hı (sı) = 


We compute the higher order responses by Laplace transforming the differen- 
tial equation and retaining only terms of the relevant order. To obtain the transfer 
functions directly we use a Dirac pulse as input. The Laplace transformed of the 
second-order part of the differential equation is 


[1 + rC(s1 +s2)]H2(s1, s2) = al — Ay(s1) Ai (s2)]. 
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The second-order transfer function therefore is 


Ay (s), s2) = SHGs +s2)[1 — Hı (s1) Hı (s2)] . (14.18) 


Consider the case in which the modulator is driven by two baseband 
(vy, vr, Up Uo) real tones at w; and œ. The tones of interest in the effective input 
signal x of the WNTI model are at +(w7 + w;), i = 1, 2. Under the assumption that 
|w;| K wr K wzag We can approximate H; at these frequencies by 


AM(jo)~1— Jy 


@3dB 


Using this approximation in H, we find 


g2 1 — (1 — ja) /wzag)(1 — J@2/@3a8) 
w +o: 
gı DEI s 
wito 
x = Tow (14.19) 
2(Ve — Vr) 1 + J% 


@3dB 


H(Jæi, Ja) © 


where in the last step we have neglected the small quantity w1 w2/ or, g- This expres- 
sion shows that, to reduce the principal second order distortion components under 
the given assumptions, it is more convenient to choose a voltage Vg — Vr as large 
as possible than merely reduce r by using a wider transistor. 


14.2.8 Third-Order Distortion 


We next compute the third-order transfer function. The third-order part of the Laplace 
transformed differential equation is 


[1 rC (s1 + 52 + s13 (s1, s2, 53) = —25 [Hi (s1) Hala, 83) ym - 
1 
From it we immediately obtain 
2 
H3 (s1, 52, 53) = -2% p, (s1 + 52 + 83) [H1 (81) Ho (82, 53) loym - (14.20) 
1 


To gain some insight from this expression we assume again two real input tones 
with |w;| K wr K wag. Under these assumptions we can expand H; in a first order 
Taylor polynomial and obtain 
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Fig. 14.10 Quadrature 
modulator IP3 as a function 
of w3qB /@7 normalized to 
|g1/g2|. Solid line computed 


with the full A3 in (14.20), 
dashed line IP3 computed a 
with (14.22). œ = œ = = 
(1 +0.1)or, 03 = = 
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4/82\201 + @2+ @3 
A3(J@ 1, J@2, J@3) © =J (=) 


3\g, ®3dB 
C 
J(@i + @2+ al (14.21) 
3B(VG — Vr)” 


As for second-order distortion we find that it’s more convenient to choose a large 
Vc — Vr than to increase the width of the transistor. Using this expression we can 
estimate the IP3 of the modulator as 


| @3dB = (Vg — Vr) @3dB 
aor or 


The value of this approximation is compared with the value calculated from the full 
H; (14.20) as a function of w3¢g/@7 in Fig. 14.10 for wı = œ = (1+ 0.1)wy and 
w3 = —(1 + 0.2)w 7. The approximation gives a reasonable value from w3gg/@7 = 
2. For values of wag /ær < 1 the IP3 is seen to raise. This is however related to 
the fact that the wanted signals do also experience substantial attenuation compared 
with the case of a large ratio w3y3/or. 

It is important to realize that the effective input signal x of the model includes 
many tones that produce many intermodulation products. In particular, the tones 
at 3wær — wi, i = 1, 2 together with the main tones at œr + œw; produce third-order 
intermodulation products that falls close to the wanted signal and are difficult to 
suppress 


~ E! 
Am3 © — 


82 


(14.22) 


Bær — wi — 2(@7 + wi) = wr — 30; . 


These tones are called third order counter intermodulation products (CIM3). We 
saw in previous paragraphs that many practical imperfections introduce tones at the 
second harmonic of the input signals 2(@ 7 + @;). In this case second-order distortion 
does also produce tones around the signal of interest. In particular the combination 


336 14 Periodically Switched Circuits 


1.0000000 
0.1000000 | : : ; J 


0.0100000 + : i : : J 


ISL (V) 


0.001 0000 OFSET ETFI : ETT EPEITI ER SEAT | RTE EPO E IT PAE PPT ETS : IEEE. EELEE SE EAEAP PEES API sA 
0.0001000 : 


ul i li h 


1000 1500 2000 2500 3000 
f (MHz 


0.0000100 


0 


Fig. 14.11 Single-sided output spectrum of the modulator simulated with accurate transistor models 


50 


2(wr — wi) — (@7 + wi) = wr — 30; 


results in a tone at the CIM3 frequency as does the second-order distortion between 
3wq7 — wi; and 2(wr — wi) 


Bar — wi) — (wr + wi) = wr — 30; . 


Depending on the details of the design, these second order distortion components 
may contribute significantly to the overall CIM3 level of the modulator. 

Figure 14.11 shows part of the output spectrum magnitude obtained by numerical 
simulation of the modulator with accurate transistor models, a load capacitance of 
1 pF, an LO frequency of 1 GHz, and two input tones given by 


v7 (t) = —v, (t) = Acos(@;t) + A cos(œzt) 
vo(t) = —vo(t) = A sin(wıt) + A sin(œ@zt) 


with aie Ör 
A=0.15V, = —, = = 

w] 8 w = wı + 64 

We used 22nm FinFETs modelled with BSIM-CMG compact models [29] with 
technology parameters from [30]. The transistors were sized to have an roy of 20 Q 
at Vg — Vr = 0.5 V. Since the threshold voltage of the transistors is 0.311 V, the LO 
voltage high level was chosen to be 0.811 V, while the low level was set to —0.189 V. 
To avoid overlapping the duration of the high pulses was reduced slightly from the 
nominal value of 7/4 to produce a cross point between successive LO signals ca. 


14.2 Voltage-Mode Quadrature Modulator 337 


1 


0.8 


0.6 


= 04 


0.2 


Fig. 14.12 LO signal waveforms used in the simulation of the modulator 


0.1 V below Vr. The raise- and fall-times were set to 0.125 ns giving an LO transient 
slope K of 8GV/s. The LO signals used in the simulation are shown in Fig. 14.12. 

The spectral component levels obtained by simulation compare favorably with 
our analysis. The expected level of the main tones is 


4 
A—— 70.135 V 
V27 


and is very close to simulated one of 0.131 V. Our choice of parameters is such that 
W3aB/@7 © 7.9. We can therefore estimate the expected second- and third order 
intermodulation products from the approximate transfer functions given by (14.19) 
and (14.22) respectively. The expected level of the second order tone at 2(@7 — @1) 
is estimated to be 


4 \21 
(az) zeU Er 0), (or =o) ~ 2.29 mV 


v27 


The expected IM3 level at wr + 2wı — wo is 


4 \33 
(az) FMU or + 01), or +o), = or +n) x 0.23 mV. 


The simulated values are 5.44 and 0.35 mV respectively, reasonably close to the 
predicted values. 

In our simplified analysis we assumed zero LO signal raise- and fall-times. It is 
interesting to investigate how fast the LO transients have to be before the intermodula- 


338 14 Periodically Switched Circuits 


0.002 r T 
0.0018 + 
0.0016 H 
0.0014 H 
0.0012 + 


0.001 + 


|IM3| (V) 


0.0008 }- 


0.0006 }- 


0.0004 j} 


2 i i i i i i i i 
0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 
fr/K(-) 


0.000: 


Fig. 14.13 Simulated IM3 versus LO transient slope K 


tion products start to deviate significantly from the predicted values. We investigated 
this question by simulation. The IM3 level is plotted as a function of the LO tran- 
sients slope K in Fig. 14.13. For all values of K the crossing point was kept constant. 
Note that the LO waveforms corresponding to the lowest K values are essentially 
triangular with no flat high level region. This simulation suggest that the analysis 
gives reasonable IM3 estimates for values of fr/K < 0.14. 


14.3 Sampling Mixer 


A communication receiver should ideally be able to detect a single signal on a 
channel of a frequency band allocated to the service of interest and completely 
suppress all other signals. Due to limitations in the selectivity of filters and the 
difficulty of implementing tuneable filters, this can only be achieved approximately. 
Virtually all receivers are composed by a fixed highly selective filter (the preselection 
filter) typically implemented with surface- (SAW) or bulk-acoustic wave (BAW) 
technologies suppressing all signals outside the band of interest. This filter is then 
followed by some signal amplification and by a shift of the signal of interest to a 
lower fixed frequency with the help of a mixer. At this lower frequency another 
fixed filter (the channel filter) with a bandwidth corresponding to the bandwidth of 
a channel separates the wanted signal from other signals on adjacent channels. The 
channel of interest is selected by shifting in frequency of the input spectrum in such a 
way that the desired signal falls in the passband of the channel filter. This is done by 
appropriately choosing the frequency of the so-called local oscillator (LO) driving 
the LO port of the mixer. 
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The sampling mixer analysed hereafter is an attempt to remove the preselection 
filter and implement a tuneable filter capable of selecting a single channel only 
using components available in a standard CMOS technology. This is driven by the 
desire for miniaturisation and cost reduction. While this type of circuits have their 
own drawbacks they are a nice example showing some capabilities of time-varying 
systems that can’t be matched by LTI ones. 


14.3.1 Time-Varying Impulse Response 


Consider the highly idealised sampling mixer shown in Fig. 14.14. The input signal is 
represented by the voltage source V, and the nodes labelled Vo, ..., Vy_ represent 
output signals. The ideal switches S,,n = 0,..., N — 1 are driven by the J -periodic 
clock signals ¢,. A switch is closed when the corresponding clock signal is high and 
open when low. We assume that the clock signals are non-overlapping. Since no 
reactive component is present on the source side of the switches the output signals 
can be analysed independently of each other. In the following we assume that each 
clock phase has the same duration so that 


T 
th =n—; n=0,...,N-—1. 
N 


In this case it’s enough to compute the time-varying impulse response of one output 
signal only. The other ones are then obtained by a simple translations in time. We will 
therefore compute the time-varying impulse response corresponding to the output 
Vo that in the following we will denote by y. Similarly, we will denote the source 
signal by x. 


Fig. 14.14 Linear periodic R Vy 

switched capacitor circuit I a aia l 

that can be used as a ġo Z So ¢i Z Sı On-1 Z Sn-1 
sampling mixer or as an V; i © 1—0 Vo |—o V t—O Vn-1 
N-path filter C C TE O 
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The circuit, having two phases, is described by two differential equations. Between 
0 < t < tı, when the switch is closed, it is described by 


1 
D + — y = 
y Moy wox Wo RC 


while for ti < t < T, when the switch is open, by 
Dy =0. 


We start by computing the fundamental kernel of the system W(t, t) which is the 
solution of the differential equation when driven by a Dirac impulse occurring at 
time t. Since the circuit varies periodically in time, it’s enough to compute it for 
O0<t<TJT. 

For 0 < t < tı the output is zero up to time t at which point it will jump to 
1/RC and start to decay exponentially as in an LTI system. At time t, the switch 
is opened, leaving the output capacitor floating. The output voltage will therefore 
remain constant up to time 7 . At time F , since there is a resistor between the capacitor 
and the source, the output will simply start to decrease exponentially again with the 
same time constant 1/RC as during the first part of the response. Continuing this 
process we obtain (see Fig. 14.15) 


0 t<tT 
Wit.) = ape Col) T<t<t 
ara apBAt eK) ep ct ck +t,k>1 
BAK t +kT <t<(k+DT,k>0 


Fig. 14.15 Fundamental 
kernel of the sampling mixer 
fort = 0.17, N = 

4,wT =0.5 


Witt) 
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with 
Ae Bre MOND) 


For ti < t < T, given that the output is disconnected from the input, the output 
remains zero 


W(t,t) =0. 


hera 


The time varying impulse response can be derived from the fundamental kernel 
with the help of the variable substitution £ = t — t and by keeping in mind that the 
value of the impulse response h(t, €) is the value of the output at time t assuming 
that a Dirac impulse was applied € seconds in the past. As the impulse response is 
periodic, it’s enough to compute its value over the first period. For 0 < t < tı it is 
given by (see Fig. 14. 16a) 


Fig. 14.16 a Time-varying a 
impulse response of the 
sampling mixer as a function 


of time for € = 0.1T, 

N = 4, w = 0.57 b 

Time-varying impulse 0.4 H 

response of the sampling EA 

mixer as a function of € for = 034 

t=O0.1T, = 

N = 4, w = 0.57 0.2 L 
0.1 F 


h(t.) 
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ape TE O0<é<t 
A(t, E) = 4 wpAke PED tth + kT <& <tt+kT,k>1 
oe 0 otherwise 


and for ti < t < T by 


wp Ake“ OlE-OT +10) th HkT <E <t+kT,k>1l 
h(t, £) = 


ti<t<T otherwise . 


14.3.2 Time-Varying Transfer Function 


While the circuit is fully characterised by the above time-varying impulse response, 
its filtering characteristics are best understood by analysing its time-varying fre- 
quency response. This will allow us to easily obtain the output signal when the 
circuit is driven by a tone. 

We compute the time-varying transfer function A(t, w) by Fourier transforming 
h(t, €). For 0 < t < tı we have 


t a t+kT 
h(t, w) = f everomterag + 5 / an Ake EIT) eE de 
0 k=l t +kT 


The terms in the right summation are powers of the summation variable k multiplied 
by a constant and reminds of a geometric series with a missing first term. As a first 
step we therefore add the missing term by adjusting the limits of the first integral 


a 33 t+kT 
it, w) = — | age TEE dé + 5 wy Ak eT f e O08 108 gE s, 
= i t—t1+kT 
Evaluating the integrals and simplifying we find 
—(w+Jo) (t— + 
1 — e oto) (t=) Leste eed elmotyo)n _ LS Ate jT. 
@ 
Lael ses Lis A 


Performing the summation of the geometric series we finally obtain 


x T= e7 ot Jo) (tt) (eereton a 1)e™ Coto) 
Ad, o) = 4 l 
0<t <t 1+ J A (1 +J 2) e e7 oT e=) 


(14.23) 
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For tı < t < T the time-varying frequency response is given by 


a t+kT 
ht, w) = > wg Ake Cole EF 41-1 Ne I08 de i 
k=0, t HKT 


This is again a geometric series and proceeding as above we obtain 


N (een = ee 
h(t, œ) = 


. 14.24 
ti<t<T (1 + J) = e—oT e—oot) ( ) 


14.3.3 Selectivity 


With htt, œ) the output of the circuit when driven by x(t) = cos(wt) is immediately 
obtained : 
y(t) = RAlt, we}. 


The output is shown in Fig. 14.17 for two values of the input frequency and N = 4. 
During the time intervals ti + kT < t < (k+1)7,k € Z the output is constant and 
assumes the value 


ot —W okT 
y(t) = A(t, w)e!'} = nf (e le oes | 


(1+ 2) — e JOT eat) 


where we used the periodicity in time of h (t, w) and the previously computed expres- 
sion valid for ti < t < T 


h(t, w) =h(t —kT,o). 
ti<ct—kT <T 


These values are the output sample values of the sampling mixer. Let’s denote them 
by y[k] and set w = nw; + Aw,n € Z with w = 2r /T and Aw < w,/2. Then the 
above expression becomes 


Ik] = Rhan + Avei) 


(e (nos+Ao)ti _ e720 je AokT (14.25) 
=R 


These are the samples of a sinusoidal with angular frequency Aw and amplitude 


(e7 (nøos+Aw)ti __ en) | 


| (1 +J reætâo)(] — e~J AoT e—o0t1) 
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Fig. 14.17 Sampling mixer a 1 
output Vo when driven by 
cos(@t) with 

N =4, w =0.57.a 

w = 1.0lws. b w = 1.20, 
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The fact that the output samples correspond to samples of a sinusoidal with a fre- 
quency independent of n is a manifestation of aliasing inherent in every sampling 
process. The interesting aspect of the sampling mixer is the fact that only samples 
of tones with a frequency very close to nw, have a significant amplitude while the 
ones of signals with frequencies at a distance larger than approximately wo/N from 
nw, are attenuated. This effect is due to the factor 


Aw 2x 20 


1— e JA 720l =] = e7721 os e N os 


in the denominator of the above expression. For wo &K Næs the last exponential 
on the right is only slightly smaller than 1. Therefore, for Aw < wo/N this factor 
becomes small thereby boosting the value of the samples around those frequencies 
(see Fig. 14.18). From this we conclude that the sampling mixer not only behaves as 
a sample and hold, but it also acts as a highly selective (large quality factor) filter 
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Fig. 14.18 Equivalent 
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around the sampling frequency and its harmonics. The achievable selectivity is much 
higher than the one of LTI RLC filters integrable in a standard CMOS technology. 


14.3.4 Even Harmonic Response Suppression 


The response around the harmonics is generally undesired and can be suppressed by 
using weighted sums of the N outputs as discussed in Example 12.10. In the following 
we assume N even and investigate the possibility to suppress the responses at even 
harmonics by making use of the sample values on the capacitors as opposed to the 
full waveforms. Consider the sample value on capacitor i = N /2 


Vy a(t) = RÉ —T/2, wel} 
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Denoting the sample value held by the capacitor during the interval tı + tı N/2 + 
kT <t <T +t,N/2+kT by yn/2[k] and using the periodicity in time of h(t, œ) 
as before we obtain 


(es ms ee eee) ; 
Jo 
(CEEI eT emn) | 
(eren = eh) 
=y 
l (1 + J2) — e—oT e— aot) 


= Mf hem (co)erK7 e172} 


yy 2[k] = nf 


QlOUT+T/2) | 


If the frequency of the signal is close to the nth harmonic of the sampling frequency 
w = nws + Aw, Aw « ws the last expression becomes 


yn 2k] = N fhos $ Awel AT 1yr | l 
The difference between sample values on capacitor Co and Cy,2 is thus given by 
yik] — yw2lk] = Nf hes (ne + Awe T] — prezi] 
Under the assumption Aw < w, the right most exponential is close to 1 so that 


0 n even 
yik] — yupik] ~ a T 

For N = 4 the four output samples can be combined in pairs forming signals 
corresponding to the in-phase (V;[k] = Vo[k] — V2[k]) and quadrature (Vg[k] = 
V,[k] — V3[k]) output of a quadrature mixer. 

Before concluding this section we note that a loss of charge from the capacitor 
during the hold phase results in a lower boost of the samples around the frequencies 
nws, n € Z. A loss of charge could be caused for example by a finite load resistance 
or by a switched-capacitor circuit following the sampling mixer. The reduction in the 
magnitude of the samples comes from the fact that the value of A appearing in the 
definition of h(t, €) will become smaller and as a consequence the boosting factor 


{= e JACT A 


in the denominator of A(t, œw) will not become as small as calculated above. 
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14.4 N-Path Filters 


The block diagram of a general N-path filter is shown in Fig. 14.19 and can be 
thought of as the cascade of an N-path receiver and an N-path transmitter (com- 
pare with Example 12.10). Among other things they permit to implement transfer 
functions that, under suitable assumptions, mimic the ones of LTI networks that are 
difficult or not manufacturable with RLC elements due to the limited range of actu- 
ally implementable values or due to limitations in their quality (quality factor). The 
time-varying transfer function of a general N-path filter can be analysed using the 
same methods used to analyse the N-path receiver of Example 12.10. Here instead 
of the general case we analyse a concrete implementation that shows some useful 
applications. 

In the following we analyse the simple case in which the N LTT subsystems are 
simple shunt capacitors and where the periodic input and output functions are equal 
switching functions. Under these conditions, when the switches of path k are closed, 
the upper plate of the kth capacitor is simultaneously connected to the input as to 
the output and we obtain the circuit shown in Fig. 14.14 where now the output is 
constituted by the node labeled V+. 


14.4.1 Time-Varying Frequency Response 


During clock phase k,0 < k < N — 1, during which the switch S% is closed, the 
voltage Vy is equal to the voltage V;. For this reason we can express Vy in terms 
of the time-varying frequency response that we obtained in the previous section. In 
particular, when the input is a complex tone e/® the output is given by 


Fig. 14.19 Block diagram h(t) b(t) 
of a generic N-path filter 
L-T) b(t-T x) 
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N-1 
Ve (t) =e!" S smt- kt), @)1,(t — kti mod T) 
k=0 


with 


10<t<t 
1,@= f 
0 otherwise 


and where we denoted the time-varying frequency response of the sampling mixer 
by hsm to avoid confusion with the one of the whole N-path filter that we’ll denote 
by h. For 0 < t < T, using (14.23) the frequency response of the filter is thus given 


by 


i¢.oj=— se 
,o) = 
1+ ta 1+ ove 
felt = 1) N-1 
+ e7 @o+jo)t—kti)] t- kt). 
(1 + J2) — = = 1 ( 1) 


As the time-varying frequency response is J -periodic, we can expand it in a Fourier 
series. The nth Fourier coefficient of the summation on the right is 


T N-1 

1 

an = al eT ot sone kt) 4 = kt)e J" dt 

F 
0 k=0 

i N-i KDT /N 

= F elotsoykt e leots(@tnes)|t qp 

k=0 kT /N 


1 1 — e loots @tnes) It NOI 


= T w+ Jl +nows) 


—ynaskt 
k=0 
The last summation is zero unless n is a multiple of N in which case it evaluates 


to N 


T wtyj(w+na,;) 


—e log ty (@tnas Jt} 
NI ee n=Nm,me Z 
an = : 
0 otherwise. 


The time-varying frequency response of the filter is therefore given by 


A(t, o) = >D hy(w)el"" (14.26) 
n=...,—4,0,4,... 
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with 


1 —eot sot (etzo -_ 1) 


lJ + (1+ J) — e JOT gent) 


or, after some simplification 


hy(@) = 


1 (ee T=) E 1)(1 = e foots (otroN) 
| t1 w0 (1 = e107 e=% ) (1 +j etnos) 


w0 


+72 | . (14.27) 
The last term of h, (œw) includes the same factor that we discussed in the analysis of 
the sampling mixer and responsible, under the condition wot; « 1, for boosting the 
response of the circuit at frequencies w = kw, + Aow, k € Z, Aw < wo/N. There- 
fore, for wot; < 1 the transfer function 4o(@) represents a highly selective band-pass 
filter with pass bands centered at kw, with k Æ Nm, m € Z (see Fig. 14.20). 


Fig. 14.20 N-path filter 
transfer functions magnitude 
for N = 4, w = 0.57. 

an = 0. b Detail around 
S/fs © 1 forn = —4, 0, 4 
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The transfer functions h, (œ), n Æ 0 are also highly selective, but they also intro- 
duce a shift in frequency. In particular an input tone at w — nw, passing through 
h,(@) results in an output tone at w which will overlap with the response of ho(@) 
to an input tone at w. Therefore, if this N-path filter is used in a receiver to suppress 
interfering signals, while it possess undesired pass-bands, the closest frequency of 
an interfering signal that at the output of the circuit will overlap in frequency with 
the wanted signal is Næs. The magnitude of a few transfer functions producing an 
output tone at w are shown in Fig. 14.20. 


14.4.2 Selectivity 


By applying some approximations to ho valid in the vicinity of ws we can obtain a 
transfer function that can be implemented with fixed RLC components. This will 
allow us to quantify the selectivity of the filter in terms of standard metrics. 

As a first step we set again w = ws + Awanduse Aw < ws; to make the following 
approximation 


1 1 1 
ee- _ 1)\) x el @sti/2 (eJ@sti/2 _ @—Josti/2 
1+ ja tiwo (( )) JOsty ( ) 
yn sin(a/N) 


m/N 


=elt/ 


Similarly, using in addition wọ K ws 


(1 — en lots@.+A0)IH) mw eum/n __2J Sin(xr/N) 
(is e7 @+Ao)T gents) (1 d J”) (JA@T + wot) Jo 
sin(z/N) 


(l+ yN22)x/N- 


Ry e Jt/N 


Finally, using these approximations and noting that the first summand in ho(@) is 
small compared to the second we obtain 


2 1 (14.28) 


ho(@) © ( ; 
o(@) a/N / 14+ jN42 
The impedance of a parallel LTI RLC resonator with a resonance frequency of 
Ws is given by 


ra os q 
L a 


Os 
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Fig. 14.21 Model for the 
N-path filter ho(@) valid 
around w, + 


JO =z 3a fe 


with q the quality factor of the resonator and R, the impedance at resonance. Around 
the resonance frequency it can be approximated by 


R, 


Zlo, +40) x —. 
1+ J24% 


The transfer function to Vy around the resonance frequency of the circuit shown in 
Fig. 14.21 is therefore given by 


R, 1 7 1 R, 


R+R, 1+ age e 


and has the same form as the approximation of ho(œw) given in (14.28). The two are 
equal if 
R, 
R+R, 
R Ao N 


2q = 
R+ R, œs Wo 


a sin(z/N)\2 
K := = ) : 


This shows that around ws ho(œw) can be modelled as a parallel resonator with reso- 
nance frequency w; and characterised by 


K Nua 
R, = R — y qd = TT - 
1-K (1 — K)a@oT 

The transfer function of this model is compared with the exact ho (œw) in Fig. 14.22. 
For N = 4, œT = 0.5 the quality factor has a value close to 133. For comparison, 
the highest resonance quality factor implementable at RF frequencies with inductors 
and capacitors available in standard CMOS technologies is in the range of 20, with 

typical values substantially lower than this. 
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Fig. 14.22 Comparison 0 J : : : 
between the N -path filter : ; : T o2z D 
transfer function using the : 
RLC model valid around the 
sampling frequency and the 
transfer function ho for 


N = 4 and w = 0.5T 
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Appendix A 
Signal-Flow Graphs 


Signal-flow graphs (SFGs) are a graphical way to represent systems. While this is 
true of block diagrams, the conventions used with the former make them simpler and, 
for many purposes, more powerful than the latter. SFGs are sometimes used with 
nonlinear systems, but are most useful to manipulate and transform linear systems. 
In this appendix we first review some of the most useful transformations applicable 
to SFGs used to describe linear systems, so called linear signal-flow graphs. We then 
review their more limited use with general nonlinear systems. Finally, we show how 
all the rules valid for linear SFGs can be extended to weakly-nonlinear time-invariant 
systems. 


A.1 Linear Signal-Flow Graphs 


A.1.1 Construction Rules 


Signal-flow graphs are directed graphs. Each node represents a variable, in our case 
a signal. Signals flow along the branches (or edges) of the graph in the indicated 
direction. Each branch is labelled by the transmission factor of the branch. A signal 
flowing along a branch is composed with the branch transmission factor: in the 
time domain composition is effected by using the convolution product, while in the 
Laplace or frequency domain by using standard multiplication. The transmission 
factors in the time domain representation of a system are impulse responses while in 
the Laplace representation are transfer functions. 

All signals entering a node through branches are summed at that node. In other 
words, the value of the variable represented by a node is the sum of the entering sig- 
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Fig. A.1 Signal-flow graph 
corresponding to Eq. (A.1) 


Xo xy X2 X3 


nals. The value of the variable represented by any node is transmitted to all branches 
leaving the node. Nodes without incoming branches are source nodes. Nodes with 
only incoming branches are sink nodes. 

Using the above rules SFGs can be used to represent systems of linear algebraic 
and convolution equations. The usefulness of SFGs comes from the fact that their 
graphical nature helps clarify the relation between variables and the existence of 
feedback loops. For example, the system of equations 


xı = axo + dx + ex2 
x = bx, (A.1) 


X3 = CX2 


can be represented by the SFG shown in Fig. A.1. 

Here and in the following, for simplicity of notation, we will use single letters to 
represent transmission factors and suppress all product symbols. Before proceeding 
discussing more aspects of SFGs it’s convenient to introduce some specific termi- 
nology [37]. 


Intermediate node: A node with incoming and outgoing branches. 

Path: A connection from a starting node to an end node by a continuous, uni- 
directional succession of branches all of which are traversed along the branch 
direction. 

Open path: Any path not touching a node more than once. 

Loop: A path starting and ending at the same node. All other nodes are touched at 
most once. 

Self-loop: A loop touching a single node. 

Non-touching loops: Loops without common nodes. 

Path-/Loop-gain: The product of the transmission factors of the branches forming 
the path resp. the loop. 


A.1.2 Reduction Rules 


Many of the algebraic manipulations performed in solving linear equations can be 
translated in reduction rules for SFGs [37]. These are graphical rules to transform a 
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given SFG in an equivalent, but simpler form. Hereafter we list the most useful rules 
together with the algebraic equations proving the equivalence. 


1. Parallel transformation: 


b 
ee ax b 
o_—__»>___—_ 
Xo X1 Xo x] 
xı = axo + bxo xı = (a + b)xo 
2. Cascade transformation: 
a b = ab 
$2 $+. o> 
Xo X1 x2 Xo x2 
Xo = bx, X1 = AXo X2 = abxo 
3. Star to mesh transformation: 
X1 X1 
Ab ab cb 
a c = 
. > + + . 
Xo X4 X2 xo X2 
yd ad cd 
X3 X3 
xı = bx4 xı = abxo + cbx2 
x3 = dx4 x3 = adxo + cdx2 


X4 = axo + CX2 


This rule can also be applied in the case in which a transmission factor is zero 
and in the case in which some nodes coincide. 
4. Node elimination: 


c d 
ca cd bd 
> > 
Xo xX} x2 as 
a 
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Xo = CX Xo = caxo + cdx2 
X2 = bx, x2 = baxo + bdx2 


xı = axo + dx2 


This is a special case of the star to mesh rule. 
5. Shifting start point: 


c cla 
a b = a b 
> - > e > > 
Xo xı x2 Xo X1 x2 
X1 = aXo xX, = AXo 
x2 = bx; + cxo x = (b+ c/a)x, 
6. Shifting end point: 
c c/b 
a b = a b 
> - > > > . 
Xo xX x2 Xo xí x2 
X1 = axo xi = (a + c/b)xo 
x2 = bx; + cxo x = bx} 


Note that this transformation changes the variable x; into a new one x; Æ x1. In 
spite of this the total transmission factor from x9 to x2 remains unaffected. 

7. Path inversion: A branch can only be inverted if it starts at a source node (no 
incoming branches) as the branch with transmission factor b in the illustration. 


xo X] x2 Xo x] x2 
e > 4 . 
—a/b —c/b 
a b c = 1/b 
X4 d X3 X4 d X3 
a c 1 
x3 = axo + bx, + cx2 xy= -7% = p”? + AÁ 


Note that application of this rule moves a source of the graph. Consecutive appli- 
cation of the rule allows inverting a path from a source to an arbitrary node. 
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8. Self-loop elimination: 


Xp = CX] X0 = CX] 
x = bx, x2 = bx 
a d 
xı = axo + dx + ex x= xı + x2 
l-e l-e 


In the first few transformation rules only summations and multiplication of trans- 
mission factors occur. Therefore, if the initial transmission factors correspond to 
transfer functions (or impulse responses) of stable systems, so do the ones of the 
transformed graph. This is not necessarily the case with later rules entailing divi- 
sions. 


A.1.3 Mason’s Rule 


One of the strength of SFGs is the fact that Mason’s rule allow writing the transfer 
function (or impulse response) from a source node to another node by inspection 
[38]. Before stating the general rule we first focus on graphs with a single open path 
from a source node x, to another node x; and where the path touches every loop in 
the graph (that means that it has at lease a node in common with every loop). For 
this case the transmission is given by 


with P,; the gain of the open path and A the graph determinant (or system determi- 
nant). If we denote the loop gain of loop i by L;, the graph determinant is defined 


by 
A:= 1- SOL; + DOLL; — SO LL iL ++ : 


the first summation being over all loops, the second over all pairs of non-touching 
loops, the third over all triplets of non-touching loops, etc. 

As an example consider the SFG depicted in Fig. A.2. From x9 to x4 there is a 
single path touching all three loops. The transmission To4 is 
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Fig. A.2 Single path f 
multi-loop SFG 
e Ly g 
a Li L 
s- > > > > 
Xo x b 2 c X3 d x4 
Poa 
To4 = 
1 — (Lı + L2 + L3) + LiL 
abcd 


~ 1— (be + dg + bef) + bedg ` 


Consider now the case of two open paths from a source node to another node. Since 
the graph represents a linear system, the output is the sum of the two contributions 


P P. 
P a 2 
A A 
where the individual contributions are calculated as for the single path case, with P; 


the gain of path j and A; the determinant associated with path j, that is, calculated 
discarding the loops not touched by path j. This expression can be rewritten as 


_ PA:+ PA; 
A 


T ; A = A142. 
The denominator of this expression A is the determinant of the full SFG and each 
path transmission P; is multiplied by the determinant of that part of the graph with 
no nodes in common with the path. Generalising this expression to more paths we 
obtain Mason’s rule 

. Aj 


A 


T (A.2) 


where here, differently from above, A ; denotes the co-factor of path k defined as the 
determinant of that part of the graph which doesn’t have any node in common with 
path j. A formal proof can be found in [39]. 


A.2 Nonlinear Systems 


The use of signal-flow graphs with nonlinear systems is more limited. One (and the 
first) application is in studying the smallest number of implicit equations in a system 
of nonlinear equations [38]. In this context a branch simply represents a dependence 
of the variable represented by the node pointed to by the branch in question by the 
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Fig. A.3 Example 
signal-flow graph 
representing Eq. (A.3) 


XO xX] X2 


variable represented by the node where the branch originates. For example the set of 
nonlinear equations 
x1 = fı (xo, x2) 


(A.3) 
X2 = fo(Xo, X1, X2) 
is represented as in Fig. A.3. In this setting there is no concept of branch transmis- 
sion. Also, parallel branches between nodes make no sense. In spite of this one can 
adapt those reduction rules based only on variable substitutions. For example, in the 
above example we can remove x; by a simple substitution obtaining a single implicit 
equation 7 
x2 = fr(xo, fio, x2), x2) = f (Xo, x2). 


A second way in which signal-flow graphs are used in conjunction with nonlinear 
systems consists in retaining most construction rules of linear SFGs, but allow the 
use of nonlinear functions instead of transmission factors. This approach is popular 
for example in the study of neural networks [40]. As an example Fig. A.4 shows the 
signal-flow graph of Rosenblatt’s perceptron. In this model the input branches posses 
transmission factors labelled w , but the branch connecting to the output y represents 
the application of the nonlinear function g(.) to the signal v. The function g(.) is 
called activation function and in this context has a monotonic, limiting character. 

This method is useful to analyse graphs without feedback loops. There is no 
equivalent of Mason’s rule applicable in the presence of feedback loops. 


Fig. A.4 Signal-flow graph 
of a Perceptron 
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A.3 Weakly-Nonlinear Systems 


The equations describing weakly-nonlinear time-invariant systems can be solved in 
an iterative way as described in Chap. 9. The component of order k of the Volterra 
series representation of a signal is calculated by solving a linear system of equations 
with products and powers of terms of order lower than k acting as sources. The 
signal-flow graph of a weakly-nonlinear system can thus be constructed as a linear 
SFG representing the linear part of the equations and where products and powers of 
signals are added as source nodes. In this way, all features of linear SFGs can be 
used, including Mason’s rule. 

One starts by calculating the first order response due to the external source (which 
is of order one) as with linear systems. With it one can then compute the value of 
the sources of second order. More generally, with the responses of order up to k — 1 
one can compute the sources of order k. The latter drive a linear system. The transfer 
function of order k can often be written by inspection using Mason’s rule as follows 


1. Compute the linear transfer function from the source to the node of interest. 
2. Replace the single Laplace variable s of the linear transfer function by the sum 
Sı +--+ + sg (see Sect. 10.1). 


The procedure works in the time domain as well. One has simply to use impulse 
responses and the convolution product, and adapt the second step. 

The following example illustrates the use of SFGs to analyse WNTI systems. In 
Sect. 10.2 we use SFGs to illuminate the effect of distortion within a feedback loop. 


Example A.1: Driven Pendulum 


In this example we analyse the pendulum shown in Fig. A.5. A weight of mass m 
at the end of a massless rod is suspended from a pivot in Earth’s gravitational field. 
We model the weight as a point mass whose position is specified by the angle ¢. 
We assume the presence of some viscous fluid causing a drag proportional to the 
velocity of the weight. A motor drives the pendulum exerting a periodic torque M. 
The equation governing the dynamics of the system can be obtained from Newton’s 
second law 
2 b g. 1 
D°¢+ —D¢+ = sin ġ = —— K cos(ar) 
Im l Pm 

with M(t) = K cos(at). 

For convenience, we re-express the coefficients of the equation in terms of the 
standard parameters for systems of second order and normalise the amplitude of the 
torque 


m K 
oo = ff q=Vele: Aim es 
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Fig. A.5 Driven pendumum 
in Earth’s gravitational field i M(t) 


Further, using the generalised velocity a = Dø, expanding the sin function in its 


Taylor series 
(Der 
ae ay oe 


n=1,3,5,... 


and defining the input signal x as 
x(t) = A cos(æœt), 


the equation can be written as a system of two convolution equations relating œ and 


$ 


a = Dô x p 

1 1 (1%? (A.4) 
ġġ = —(— Dê + — ô) xa +x + — g". 

o wq 3 n! 


n=3 55; 2 


A signal-flow graph representing these equations is shown in Fig. A.6. 

We are interested in the steady-state oscillation when the pendulum is driven 
with a frequency equal to wọ. For this reason it’s convenient to work with transfer 
functions. The various transfer functions can be determined by inspection from the 
SFG using Mason’s rule, using the transform of each transmission factor and applying 
the transform of an impulse as input. The first order transfer function is 


Fig. A.6 Signal-flow graph -(Lps+— 6 
corresponding to Eq. (A.4) (22 ig ) 


(-1)"-b?2 
2in=3,5,...— nl 


n 
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1 
Hı (sı) = 2 ls 
a+ za l 
o 0 


The linear part of the response is thus 
Q(t) = R{ A (ja) Ae!" = Ag sin(@of) . 


From the signal-flow graph it’s immediately apparent that the transmission from 
the source representing the nonlinear terms to the node ¢ is the same as the one from 
x to ġ. Using the fact that the response of a linear system to a source of order k is 
obtained by multiplying the Laplace transform of the source by the linear transfer 
function of the system and by replacing in the latter sı with the sum sı +--+ + Sk 
(see Sect. 10.1), the transfer function of order k from the “nonlinear” source of order 
k is therefore 

Fy (sı +--+ + Sx). 


There is no second order power in the source terms. Hence, the second order 
transfer function H3 is identically zero. The third order transfer function is obtained 
by substituting Hı + A in the “nonlinear source” terms ø” and retaining only terms 
of third order. This gives 


1 
gu (51) My (52) My (83) , 


hence ; 
H3(s1, 82,53) = Fae (81). Aj (2) Ay (83) A (s1 + s2 +53). 


Note that H; is already symmetric. 
The contribution of third order to the steady-stage oscillation at wo is given by the 
frequency mix m = (1, 2) 


A> 3! 
$3,m(t) = zg tt Ai sen Ai yer) Hi (yor) Goe) 


qt 
Sara COS(@pof) . 
8 
The fourth order transfer function is identically zero. The fifth order is obtained 


by inserting Hı + ---+ H; into the “nonlinear source” terms ġ” and retaining only 
terms of fifth order, giving 


The fifth order transfer function is then found by multiplying it by Hı (s1 +---+ 55) 
giving 
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1 
H5(51, 82, 53, $4, 55) = È [Ay (s1) Hı (s2) H3 (s3, S4, 95) sym 


1 
= zm (81) Hı (s2) Hı (s3) Hı (s4) Hı colme + so + 83 + S4 + 85). 


The fifth order component at wọ produced by the frequency mix m = (2, 3) is 


Ysm(t) = A® SEL R{H5(— Jo, — Jo, Jo, Jo, J)e” } 
mn 24 213! ’ ’ ’ ’ + 


There are in total 10 ways to fill the slots of [ae ® H3] 


?2Jsym 
of the tuple (—j@o, —j@o, J@o, J@0, J@o). One possibility is to put the negative 
frequencies in the first two slots giving 


with permutations 


5 
Ay (—Jj@o) Ay (— J @0) H3(J@0, Jæ, J@o) = TET 
q 


There are 3 cases in which both Hs are applied to positive frequencies 
6 


q 
Hı (Jo) Hı (Joo) H3(J@o, —J @0, — Joo) = = 


and 6 in which the two Hs appear one with a positive frequency and the other with 
a negative frequency 


6 
A (—j@o0) A (J 0) A3(J oo, J@o, —J@0) = a ; 
Summing all terms we obtain 
1 IG q° q’ 
Hp, =y zo sm, rn) = [4 Jio: 
5(—J@0, — J0, Jo; Jo, J@o) 10 128 — 5) 7 + J 359 (-J9) 


The first, third and fifth order approximations that we have found are compared to 
a numerical solution of the differential equation for g = 7 and A = 0.1 in Fig. A.7. 
Even though we didn’t took into account the harmonics produced by nonlinear terms, 
the fifth order approximation at wọ gives a fairly accurate approximation up to a swing 
of ca. 40°. A linear model predicts that the pendulum dynamics settles in such a way 
that the peak of the applied torque occurs when the pendulum is in the vertical 
position (@ = 0°). The more accurate model shows that at moderate swing levels 
this is not the case. The peak torque happens before the pendulum passes through 
the vertical position. It’s also interesting to note that the main phase correcting term 
is 3 Whose phasor is perpendicular to the one of the linear term. However, being 
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Fig. A.7 Comparison of the 
steady-stage solution of the 
driven pendumum obtained 
by numerical integration 
with the Volterra series of 
order one, three and five. 
q=7,A=0.1 


(rad) 


49 49.2 49.4 49.6 49.8 50 
t/T (-) 


at 90°, @3 m is unable to produce a good amplitude correction. For that we need to 
consider at least one more term. 
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Almost everywhere, 24 
Almost periodic, 122, 161 
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Amplitude-modulation, 198 D’, 23 
D' (T), 52 
D’ ,43 
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Ba T 
Blocking level, 205 L' ? 
Bounded, 87 Prig 122 
locally, 262 Per 
Dp, sym R ), 142 
Dei, 50 
C D(T), 
Capacitance, 209 Degeneration impedance, 244 
Cauchy principal value, 26 6,9, 11, 13, 26 
Cauchy product, 308 derivative, 30 
Chaos, 7 Fourier transform, 60 
CIM3, 335 Dense, 24 
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Differential equation 
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Dirac 
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delta, see 6 
impulse, see 5 
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Direct product, 140 
Direct sum, 139 
Distribution, 11, 22 
approximation, 48, 49 
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convergence, 24 
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partial derivative, 29 
periodic, 49, 52 
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Do, 315 
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Doppler-spread function, 276 
DR, 204 
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properties, 125 
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Floquet representation, 287 
Formal power series, 141 
Fourier series, 68 
coefficients, 68 
n-dimensional, 73 
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Fourier transform, 55, 59 
inverse, 55 
n-dimensional, 70 
properties, 65 
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Frequency mix, 160 
order, 160 
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time-varying nonlinear, 306 
Fubini’s theorem, 40 
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Fundamental kernel, 267 
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Laplace transform, 76 
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semi simple, 129 
state transition, 259 
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image response, 329 
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Numerical simulations, 12 
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Om, 92 


Operatinal calculus, 13 
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Phasor, 161 
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Polynomial 
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Quadrature demodulator, 294 
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Rayleigh distribution, 327 
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Series 
absolutely convergent, 307 
Fourier, 68 
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